Accepted May 22, 2010 

Preprint typeset using I^T^X style cmulatcapj v. 11/10/09 



O 



in 

6 



> 

in 

\o 

in 

o 
o 



s 



THE CHANDRA SOURCE CATALOG 

Ian N. Evans, 1 Francis A. Primini, 1 Kenny J. Glotfelty, 1 Craig S. Anderson, 1 Nina R. Bonaventura, 1 

Judy C. Chen, 1 John E. Davis, 2 Stephen M. Doe, 1 Janet D. Evans, 1 Giuseppina Fabbiano, 1 Elizabeth C. Galle, 1 

Danny G. Gibbs II, 1 John D. Grier, 1 Roger M. Hain, 1 Diane M. Hall, 3 Peter N. Harbo, 1 Xiangqun (Helen) He, 1 

John C. Houck, 2 Margarita Karovska, 1 Vinay L. Kashyap, 1 Jennifer Lauer, 1 Michael L. McCollough, 1 

Jonathan C. McDowell, 1 Joseph B. Miller, 1 Arik W. Mitschang, 1 Douglas L. Morgan, 1 Amy E. Mossman, 1 

Joy S. Nichols, 1 Michael A. Nowak, 2 David A. Plummer, 1 Brian L. Refsdal, 1 Arnold H. Rots, 1 

Aneta Siemiginowska, 1 Beth A. Sundheim, 1 Michael S. Tibbetts, 1 David W. Van Stone, 1 Sherry L. Winkelman, 1 

AND PANAGOULA ZOGRAFOU 1 
Accepted May 22, 2010 

ABSTRACT 

The Chandra Source Catalog (CSC) is a general purpose virtual X-ray astrophysics facility that 
provides access to a carefully selected set of generally useful quantities for individual X-ray sources, 
and is designed to satisfy the needs of a broad-based group of scientists, including those who may 
be less familiar with astronomical data analysis in the X-ray regime. The first release of the CSC 
includes information about 94,676 distinct X-ray sources detected in a subset of public ACIS imaging 
observations from roughly the first eight years of the Chandra mission. This release of the catalog 
includes point and compact sources with observed spatial extents < 30". The catalog (1) provides 
access to the best estimates of the X-ray source properties for detected sources, with good scientific 
fidelity, and directly supports scientific analysis using the individual source data; (2) facilitates analysis 
of a wide range of statistical properties for classes of X-ray sources; and (3) provides efficient access to 
calibrated observational data and ancillary data products for individual X-ray sources, so that users 
can perform detailed further analysis using existing tools. The catalog includes real X-ray sources 
detected with flux estimates that are at least 3 times their estimated 1 a uncertainties in at least one 
energy band, while maintaining the number of spurious sources at a level of < 1 false source per field 
for a 100 ks observation. For each detected source, the CSC provides commonly tabulated quantities, 
including source position, extent, multi-band fluxes, hardness ratios, and variability statistics, derived 
from the observations in which the source is detected. In addition to these traditional catalog elements, 
for each X-ray source the CSC includes an extensive set of file-based data products that can be 
manipulated interactively, including source images, event lists, light curves, and spectra from each 
observation in which a source is detected. 
Subject headings: catalogs — X-rays: general 



1. INTRODUCTION 



Ever since Uhuru (jGiacconi et al.l 119711 ). X-ray as- 
tronomy missions have had a tradition of publishing 
catalogs of detected X-ray sources, and these cata- 
logs have provided the fundamental datasets used by 
numerous studies aimed at characterizing the proper- 
ties of the X-ray sky. While source catalogs are the 
primary data produ ct s from X-ray sky s u rveys (e.g., 
Giacconi et all Il972t iForman et all 119781 : lElvis et alJ 
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1992t iVqged 11993 iVoges et al.l ll999). the Einstein IPC 
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catalog ( Harris et al.l 11990) demonstrated the utility of 
catalogs of serendipitous sources identified in the fields 
of pointed- observation X-ray missio ns. More recent 
serendipitou s sou rce catalogs (e.g . . iGioia et aJJ 119901 : 
iWhite et al.HT99l lUeda et alJl2005t iWatson et aDl2008T l 
have further expanded the list of sources with X-ray data 
available for further analysis by the astronomical commu- 
nity. 



ievans@cfa.harvard.edu 

1 Smithsonian Astrophysical Observatory, 60 Garden Street, 
Cambridge, MA 02138 

2 MIT Kavli Institute for Astrophysics and Space Research, 
77 Massachusetts Avenue, Cambridge, MA 02139 

3 Northrop Grumman, 60 Garden Street, Cambridge, MA 
02138 



Source catalogs typically include a uniform reduction 
of the mission data. This provides a significant advan- 
tage for the general scientific community because it re- 
moves the need for end-users, who may be unfamiliar 
with the complexities of the particular mission and its 
instruments, to perform detailed reductions for each ob- 
servation and detected source. 

When compared to all previous and current X- 
ray missions, the Chandra X-ray Observatory (e.g., 
Wcisskopf ct al. 2000, 2002) breaks the resolution bar- 
rier with a sub-arcsecond on-axis point spread function 
(PSF). Launched in 1999, Chandra continues to provide a 
unique high spatial resolution view of the X-ray sky in the 
energy range from 0.1 to lOkeV, over a ~ 60-250 square 
arcminute field of view. The combination of excellent 
spatial resolution, a reasonable field of view, and low in- 
strumental background translate into a high detectable- 
source density, with low confusion and good astrometry. 
Chandra includes two instruments that record images of 
the X-ray s ky. The Advanced CCD Imaging Spectro m- 
eter (ACIS; iBautz et al.l U99l iGarmire et all [200l in- 
strument incorporates ten 1024 x 1024 pixel CCD detec- 
tors (any six of which can be active at one time) with 
an effective pixel size of ~ 0.5" on the sky, an energy 
resolution of order 110 eV at the Al-K edge (1.49 keV), 
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Fig. 1. — Distribution of CSC release 1.0 master sources on the 
sky, in Galactic coordinates. 



and a typical time resol ution of ~ 3.2 s. Th e High Res- 
olution Camera (HRC; IMurrav et al.l [2000h instrument 
consists of a pair of large format micro-channel plate de- 
tectors with a pixel size ~ 0.13" on the sky and a time 
resolution of ^15.6 /is, but with minimal energy resolu- 
tion. The wealth of information that can be extracted 
from identified serendipitous sources included in Chan- 
dra observations is a powerful and valuable resource for 
astronomy. 

The aim of the Chandra Source Catalog (CSC) is to 
disseminate this wealth of information by characteriz- 
ing the X-ray sky as seen by Chandra. While numer- 
ous other catalogs of X-ray sources de tected by Chandra 
may be found in the li t erature (e.g . , IZezas et al.l 120061 : 



Brassington et al 
2008; Muno et al 



2008; Roman o et al.l 120081; ILuo et al.l 
2009; Elv is et al.ll2009t) . the region of 



the sky or set of observations that comprise these cata- 
logs is restricted, and they are typically aimed at max- 
imizing specific scientific goals. In contrast, the CSC 
is intended to be an all-inclusive, uniformly processed 
dataset that can be utilized to address a wide range of 
scientific questions. The CSC is intended ultimately to 
comprise a definitive catalog of X-ray sources detected by 
Chandra, and is being made available to the astronomi- 
cal community in a series of increments with increasing 
capability over the next several years. 

The first release of the CSC was published in 2009 
March. This release includes information about 135,914 
source detections, corresponding to 94,676 distinct X- 
ray sources on the sky, extracted from a subset of public 
imaging observations obtained using the ACIS instru- 
ment during the first eight years of the Chandra mission. 
The distribution of release 1 sources on the sky is pre- 
sented in Figure 1. 

We expect that the CSC will be a highly valuable 
tool for many diverse scientific investigations. However, 
the catalog is constructed from pointed observations ob- 
tained using the Chandra X-ray Observatory, and is nei- 
ther all-sky nor uniform in depth. The first release of the 
catalog includes only point and compact sources, with 
observed extents < 30". Because of the difficulties in- 
herent in detecting highly extended sources and point 
and compact sources that lie close to them, and quan- 
tifying in a consistent and robust way the properties of 
such sources, we have chosen to exclude entire fields (or 
in some cases, individual ACIS CCDs) containing such 
sources from the first release of the CSC, as described 
in § 3.1. Therefore, the catalog does not include sources 
near some of the most famous Chandra targets, and there 
may be selection effects that restrict the source content 



of the catalog and which therefore may limit scientific 
studies that require unbiased source samples. 

The minimum flux significance threshold for a source to 
be included in the first release of the CSC is set conserva- 
tively, and corresponds typically to ^10 detected source 
photons (on-axis) in the broad energy band integrated 
over the total exposure time. This conservative thresh- 
old was chosen to maintain the spurious source rate at 
an acceptable level over the wide variety of Chandra ob- 
servations that are included in this release of the catalog. 
We expect to relax this criterion in future releases based 
on experience gained constructing the current release. 

A number of other Chandra catalogs do include sources 
with fewer net counts than the CSC. Such fainter thresh- 
olds are attainable typically either because of specific at- 
tributes of the observations included in those catalogs, 
or because of the assumptions made when constructing 
the catalog. 

As an exa mple of the former c ategory, the XBootes sur- 
vey catalog (|Kenter et al.ll2005l ) includes sources that are 
roughly a factor of two fainter than the CSC flux signif- 
icance threshold. That survey is constructed from short 
(5 ks) observations obtained in an area with low line-of- 
sight absorption. This results in a negligible background 
level that substantially simplifies source detection and 
enables identification of sources with very few counts. 
Some Chandra catalogs derived from observations with 
the range of exposu res comparable to those that com- 
prise the CSC (e .g., lElvis et all [20091: ILaird et al.l [20091 : 
iMuno et al.ll2009f ) also include fainter sources. However, 
in these cases the additional source fractions are in gen- 
eral not large, typically adding < 10% more sources be- 
low the CSC threshold, as described in detail in § 3.7.1. 

For other Chandra catalogs, visual review and valida- 
tion at the s ource level is a planned part of the p rocessing 
thread fe.g iKim et al.ll2007t l Muno et al.l l2009l). In i 



cases (e.g.. lBroos et al 
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2007), visual review may be used 



to adjust processing parameters for individual sources. 
Such manual steps are time-consuming, but enable lower 
significance levels to be achieved while maintaining an 
acceptable spurious source rate. In contrast, the CSC 
catalog construction process requires that the process- 
ing pipelines run on a wide range of observations with 
a minimum of manual intervention. The scope of the 
CSC is simply too large to require manual handling at 
the source level. We do not manually inspect individ- 
ual source detections, nor do we adjust source detection 
or processing parameters based on manual evaluation. 
Instead, the CSC uses a largely automated quality as- 
surance approach, as described in § 3.14. 

The sky coverage of the first catalog release (Fig. 2) 
totals ~ 320 square degrees, with coverage of ~ 310 
square degrees brighter than a 0.5-7.0 keV flux limit of 
1.0 x 10~ 13 ergcm~ 2 s _1 , decreasing to ^135 square de- 
grees brighter than 1.0 x 10~ 14 ergcm -2 s -1 , and ~ 6 
square degrees brighter than 1.0 x 10~ 15 ergcm~ 2 s _1 . 
These numbers will continue to grow as the Chandra mis- 
sion continues, with a 15 year prediction of the eventual 
sky coverage of the CSC of order 500 square degrees, or 
a little over 1% of the sky. 

In this paper we describe in detail the content and con- 
struction of release 1 of the CSC. However, where ap- 
propriate we also discuss in addition the steps required 
to process HRC instrument data used to construct re- 
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Fig. 2. — Sky coverage of release 1.0 of the CSC, in the ACIS 
broad energy band. The ordinate value is the total sky area in- 
cluded in the CSC that is sensitive to point sources with fluxes at 
least as large as the corresponding value on the abscissa. 



lease 1.1 of the catalog, since the differences in the algo- 
rithms are small. Release 1.1 of the catalog is scheduled 
for spring 2010. This paper is organized into 5 sections, 
including the introduction. In § 2, we present a descrip- 
tion of the catalog. This includes the catalog design 
goals, an outline of the general characteristics of Chandra 
data that are relevant to the catalog design, the organi- 
zation of the data within the catalog, approaches to data 
access, and an outline of the data content of the catalog. 
Section 3, which comprises the bulk of the paper, de- 
scribes in detail the methods used to extract the various 
source properties that are included in the catalog, with 
particular detail provided when the algorithms are new 
or have been adapted for use with Chandra data. A brief 
description of the principal statistical properties of the 
catalog sources is presented in § 4; this topic is treated 
comprehensively by F. A. Primini et al. (2010, in prepa- 
ration). Conclusions are presented in § 5. Finally, Ap- 
pendix A contains details of the algorithm used to match 
source detection from multiple overlapping observations, 
as well as the mathematical derivation of the multivariate 
optimal weighting formalism used for combining source 
position and positional uncertainty estimates from mul- 
tiple observations. 

2. CATALOG DESCRIPTION 
2.1. Design Goals 

The CSC is intended to be a general purpose virtual 
science facility, and provides simple access to a carefully 
selected set of generally useful quantities for individual 
sources or sets of sources matching user-specified search 
criteria. The catalog is designed to satisfy the needs of 
a broad-based group of scientists, including those who 
may be less familiar with astronomical data analysis in 
the X-ray regime, while at the same time providing more 
advanced data products suitable for use by astronomers 
familiar with Chandra data. 

The primary design goals for the CSC are to (1) al- 
low simple and quick access to the best estimates of the 
X-ray source properties for detected sources, with good 
scientific fidelity, and directly support scientific analy- 
sis using the individual source data; (2) facilitate analy- 
sis of a wide range of statistical properties for classes of 
X-ray sources; (3) provide efficient access to calibrated 



observational data and ancillary data products for in- 
dividual X-ray sources, so that users can perform de- 
tailed further analysis using existing tools such as those 
included in th e Chandra Interactive Analysis of Observa- 
tions (CIAO; iFruscione et al.l 120061 ) portable data anal- 
ysis package; and (4) include all real X-ray sources de- 
tected down to a predefined threshold level in all of the 
public Chandra datasets used to populate the catalog, 
while maintaining the number of spurious sources at an 
acceptable level. 

To achieve these goals, for each detected X-ray source 
the catalog records the source position and a detailed set 
of source properties, including commonly used quanti- 
ties such as multi-band aperture fluxes, cross-band hard- 
ness ratios, spectra, temporal variability information, 
and source extent estimates. In addition to these tradi- 
tional elements, the catalog includes file-based data prod- 
ucts that can be manipulated interactively by the user. 
The primary data products are photon event lists (e.g., 
lConroy|[l992l) . which record measures of the location, 
time of arrival, and energy of each detected photon event 
in a tabular format. Additional data products derived 
from the photon event list include images, light curves, 
and spectra for each source individually from each obser- 
vation in which a source is detected. The catalog release 
process is carefully controlled, and a detailed characteri- 
zation of the statistical properties of the catalog to a well 
defined, high level of reliability accompanies each release. 
Key properties evaluated as part of the statistical charac- 
terization include limiting sensitivity, completeness, false 
source rates, astrometric and photometric accuracy, and 
variability information. 

2.2. Data Characteristics 

Both ACIS and HRC cameras operate in a photon 
counting mode, and register individual X-ray photon 
events. For each photon event, the two dimensional po- 
sition of the event on the detector is recorded, together 
with the time of arrival and a measure of the energy of 
the event. In most operating modes, lists of detected 
events are recorded over the duration of an observation, 
typically between 1 ks and 160 ks, and are then teleme- 
tered to the ground for subsequent processing. 

To minimize the effect of bad detector pixels, and 
to avoid possible burn-in degradation of the camera by 
bright X-ray sources, the pointing direction of the tele- 
scope is normally constantly dithered in a Lissajous pat- 
tern, with a typical scale length of about 20" on the sky 
and a period of order 1 ks, while taking data. The mo- 
tion of the telescope is recorded via an "aspect camera" 
(jAldcroft et al.ll2000D that tracks the motion of a set of 
(usually 5) guide stars as a function of time during the 
observation. The coordinate transformation needed to 
remove the motion from the event (photon) positions is 
computed from the aspect camera data and applied dur- 
ing data processing. 

Breaking down the 4-dimensional X-ray data hyper- 
cube into spatial, spectral, and temporal axes provides a 
natural focus on the properties that may be of interest to 
the general user, but also identifies some of the complex- 
ities inherent in Chandra data that must be addressed 
by catalog construction and data analysis algorithms. 

Spatially, the Chandra PSF varies significantly with 
off-axis and azimuthal angle (with the former variation 
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Fig. 3. — Sample local Chandra model PSFs projected onto the 
ACIS detector pixel plane extracted from the CSC. The upper, 
middle, and lower sets of 4 images correspond to PSF models com- 
puted at the monochromatic effective energies of the ACIS hard, 
medium, and soft energy bands, respectively. From left to right, the 
images correspond to PSFs determined at off-axis angles 9 = 0', 5', 
10', and 15', respectively. The orientation and details of the PSF 
substructure varies with azimuthal angle, <j>. The image intensity 
scaling is proportional to the square root of the pixel flux. 



dominating), as well as with incident photon energy 
(Fig. 3). Close to the optical axis of the telescope, the 
PSF is approximately symmetric with a 50% enclosed 
energy fraction radius of order 0.3" over a wide range of 
energies, but at 15' off-axis the PSF is strongly energy- 
dependent, asymmetric, and significantly extended, with 
a 50% enclosed energy fraction radius of order 13" at 
1.5 keV. 

For the widely used ACIS detector, the instrumental 
spectral energy resolution is of order 100-200 eV, and 
depends on incident photon energy and location on the 
detector. Because the energy resolution is significantly 
lower than the typical energy width of the features and 
absorption edges that define the effective area of the tele- 
scope optics (and therefore the quantum efficiency of the 
telescope plus detector system), a full matrix formulation 
that considers the redistribution of source X-ray flux into 
the set of instrumental pulse height analyzer bins must 
be used when performing spectral analyses. This is in 
contrast to the more familiar scenario from many other 
wavebands, where the instrumental resolution is often 
much higher than the spectral variation of quantum effi- 
ciency, enabling the commonly used implicit assumption 
that the flux redistribution matrix is diagonal (and is 
therefore not considered explicitly). 

We note in passing that Chandra is equipped with a 
pair of transmission gratings that can be inserted into 
the optical path, and is therefore capable of performing 
high spectral resolution (slitless spectroscopy) observa- 
tions. However, such observations are not included in 
the current release of the CSC. 

Time domain analyses must consider the impact of 
spacecraft dither within an observation. Strong false 
variability signatures at the dither frequency can arise 
because of variations of the quantum efficiency over the 
detector, or because the source or background region 
dithers off the detector edge or across a gap between 
adjacent ACIS CCDs. Corrections for these effects, as 
well as for cosmic X-ray background flares that can be 
highly variable over periods of a few kiloseconds, must 
be applied when computing light-curves. The extremely 



Fig. 4. — Three separate observations that include the area 
surrounding the bright X-ray source CXO J162624. 0-242448 are 
shown. In each panel source detections from observations 00619, 
00635, and 00637 are identified in cyan, green, and red, respec- 
tively. Left: Observation 00619 (4.1 ks exposure). In this short 
exposure, only the bright source visible at an off-axis angle of 
~ 7.7'. The PSF is somewhat extended. Center: Observation 
00635 (100.7 ks exposure). In this deep exposure, the bright X-ray 
source is located ~ 15.6' off-axis in this deep exposure. The ex- 
tended PSF is clearly visible. Right: Observation 00637 (96.4 ks 
exposure). The bright source is located ~ 3.0' off-axis, and the 
combination of the compact PSF and long exposure resolves the 
region into 3 distinct source detections. 



low photon event rates common for many faint X-ray 
sources typically require time domain statistics to be 
evaluated using event arrival-time formulations instead 
of rate-based approaches. 

An additional level of complexity occurs because many 
astronomical sources of interest that will be included in 
the catalog are extremely faint. Rigorous application 
of Poisson counting statistics is required when deriving 
source properties and associated errors, separating X- 
ray analyses from many other wavebands where Gaussian 
statistics are typically assumed. 

2.3. Data Organization 

The tabulated properties included in the CSC are 
organized conceptually into two separate tables, the 
Source Observations Table and the Master Sources Table. 
Distinguishing between source detections (as identified 
within a single observation) and X-ray sources physically 
present on the sky is necessary because many sources are 
detected in multiple observations and at different off-axis 
angles (and therefore have different PSF extents). 

Each record included in the Source Observations Table 
tabulates properties derived from a source detection in 
a single observation. These entries also include pointers 
to the associated file-based data products that are in- 
cluded in the catalog, which are all observation-specific 
in the first catalog release. Each record in the Source 
Observations Table is further split internally into a set 
of source-specific data and a set of observation-specific, 
but source- independent, data. The latter are recorded 
once to avoid duplication. A description of the data 
columns recorded in the Source Observations Table for 
each source detection is provided in Table 1. 

Because of the dependence of the PSF extent with off- 
axis angle, multiple distinct sources detected on-axis in 
one observation may be detected as a single source if 
located far off- axis in a different observation (Fig. 4). 
During catalog processing, source detections from all ob- 
servations that overlap the same region of the sky are 
spatially matched to identify distinct X-ray sources. Es- 
timates of the tabulated properties for each distinct X- 
ray source are derived by combining the data extracted 
from all source detections and observations that can 
be uniquely associated, according to the algorithms de- 
scribed in § 3. The best estimates of the source properties 



Table 1. Source Observations Table Properties 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



Observation Identification 



obsid 


No 


No 


obi 


No 


No 


Observation Target and Pointing 




targname 


No 


No 


ra.targ 


No 


No 


dec.targ 


No 


No 


ra.pnt 


No 


No 


dec.pnt 


No 


No 


rolLpnt 


No 


No 


ra_nom 


No 


No 


decnom 


No 


No 


rolljiom 


No 


No 


Observation Timing 






St 


.start 


No 


No 


st 


.stop 


No 


No 


st 


.elapse 


No 


No 


Rt 


_obs 


No 


No 


st 


.end 


No 


No 


gt 


.mjd.obs 


No 


No 


mjd_ref 


No 


No 


Instrument Configuratior 






instrument 


No 


No 


grating 


No 


No 


datamode 


No 


No 


readmode 


No 


No 


cxptimc 


No 


No 


cycle 


No 


No 


timingjnode 


No 


No 


Processing Information 






ascdsver 


No 


No 


caldbver 


No 


No 


cr 


late 


No 


No 



dog 



Observation identifier (Obsid) 
Observation interval number (Obi) 



Observation target name specified by observer 

Target position specified by observer, ICRS right ascension 

Target position specified by observer, ICRS declination 

Mean spacecraft pointing, ICRS right ascension 

Mean spacecraft pointing, ICRS declination 

Mean spacecraft pointing, roll angle 

Tangent plane reference position, ICRS right ascension 

Tangent plane reference position, ICRS declination 

Tangent plane reference position, roll angle 



Start time of valid data, MET (seconds since 1998 Jan 01 00:00:00 TT) 

Stop time of valid data, MET 

Total elapsed time of the observation (gtLstop — gtLstart) 

Start time of valid data, ISO 8601 format (yyyy-mm-ddThh:mm:ss) 

Start time of valid data, ISO 8601 format 

Start time of valid data, MJD 

MJD corresponding to s MET 

Instrument used for the observation, ACIS or HRC 

Transmission grating used for the observation, NONE, HETG, or LETG 

Instrument data mode used for the observation 

ACIS readout mode used for the observation 

ACIS CCD frame time 

ACIS readout cycle for the observation, P (primary) or S (secondary) for alternating 

exposure (interleaved) mode observations, or P for other ACIS modes 

HRC precision timing mode 

Software version used to create the Level 3 full-field event data file 

Calibration database version used to calibrate the Level 3 full-field event data file 

Creation date/time of the Level 3 full-field event data file, UTC 



= 

I 

CO 

c 
p 

l-j 

r, 
a 

a 

o 

:r 



Table 1 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



Observing Cycle 
ao 



No 



No 



Observation-Specific Source Identification 
region_id No No 

source_id No No 



Source Position 








ra 


Yes 


No 




dec 


Yes 


No 




galJ 


Yes 


No 


deg 


gal.b 


Yes 


No 


deg 


err_ellipse_rO 


Yes 


No 


arcsec 


err_ellipse_rl 


Yes 


No 


arcsec 


crr_ellipse_ang 


Yes 


No 


deg 


theta 


No 


No 


arcmin 


phi 


No 


No 


deg 


chipx 


No 


No 


pixels 


chipy 


No 


No 


pixels 


Source Significance 








flux .significance 


Yes 


No 





detect.significance Yes 

Source Codes and Flags d 
conf.code 
dither .warning Jlag 



edge.code 

extent.code 

rnulti.chip.code 

pileup.warning 
sat_src_flag 



No 



No 


No 


No 


No 


No 


No 


No 



Chandra observing cycle in which the observation was scheduled 



Unique identifier for each detected source region within a single observation 
Unique identifier for each distinct source component within a single source region 



Source position, ICRS right ascension 

Source position, ICRS declination 

Source position, Galactic longitude 

Source position, Galactic latitude 

Major radius of the 95% confidence level error ellipse 

Minor radius of the 95% confidence level error ellipse 

Position angle of the major axis of the 95% confidence level error ellipse 

Source aperture position, off-axis angle (6) 

Source aperture position, azimuthal angle (<fi) 

Detector Cartesian x position corresponding to (8, <j>) 

Detector Cartesian y position corresponding to (6, <j>) 



Significance of the source determined from the ratio of the source flux to the estimated 

error in the local background 

Significance of the source detection determined by the wavdetect algorithm 



Source regions overlap (source is confused; bit-coded value) 

Highest statistically significant peak in the power spectrum of the source region count 
rate occurs at the dither frequency of the observation or at a beat frequency of the dither 
frequency 

Source position or region dithered off a detector chip edge during the observation (bit- 
coded value) 

Deconvolved source extent is inconsistent with a point source at the 90% confidence level 
(bit-coded value) 

Source position or region dithered across multiple detector chips during the observation 
(bit-coded value) 

ACIS broad energy band count rate per pixel per CCD frame time (see Davis 2007a) 
Source is saturated; source properties are unreliable 



i 



Table 1 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



streak _src_flag 


No 


No 




var.code 


No 


No 




man_inc_flag 


No 


No 




man_reg_flag 


No 


No 




Source Extent 








mjr_axis_raw 


Yes 


No 


arcsec 


mnr_axis_raw 


Yes 


No 


arcsec 


pos_angle_raw 


Yes 


No 


deg 


mj r_axis_raw_err 


Yes 


No 


arcsec 


mnr_axis_raw_err 


Yes 


No 


arcsec 


pos_angle_raw_err 


Yes 


No 


deg 


psf_mjr_axis_raw 


Yes 


No 


arcsec 


psf_mnr_axis_raw 


Yes 


No 


arcsec 


psf_pos_angle_raw 


Yes 


No 


deg 


psf_mjr_axis_raw_err 


Yes 


No 


arcsec 


psf_mnr_axis_raw_err 


Yes 


No 


arcsec 


psf_pos_angle_raw_err 


Yes 


No 


deg 


major_axis 


Yes 


No 


arcsec 


minor_axis 


Yes 


No 


arcsec 


pos_angle 


Yes 


No 


deg 


major_axis_err 


Yes 


No 


arcsec 


minor_axis_err 


Yes 


No 


arcsec 


pos_angle_err 


Yes 


No 


deg 


Aperture Photometry 








ra_aper 


No 


No 




dec.aper 


No 


No 




mjr_axis_aper 


No 


No 


arcsec 


mnr_axis_aper 


No 


No 


arcsec 


pos_angle_aper 


No 


No 


deg 


area.aper 


No 


No 


arcsec 


mjr_axisl_aperbkg 


No 


No 


arcsec 


mnr_axisl_aperbkg 


No 


No 


arcsec 


mjr_axis2_aperbkg 


No 


No 


arcsec 


mnr_axis2_aperbkg 


No 


No 


arcsec 


pos_angle_aperbkg 


No 


No 


deg 



Source is detected on an ACIS readout streak; source properties may be affected 

Source displays flux variability during the observation (bit-coded value) 

Source was manually included in the catalog via human review 

Source region parameters (location, dimensions) were manually adjusted via human review 

1 a radius along the major axis of the ellipse defining the observed source extent 

1 a radius along the minor axis of the ellipse defining the observed source extent 

Position angle of the major axis of the ellipse defining the observed source extent 

Estimated error on the observed source extent major axis 

Estimated error on the observed source extent minor axis 

Estimated error on the observed source extent position angle 

1 a radius along the major axis of the ellipse defining the local model PSF extent 

1 a radius along the minor axis of the ellipse defining the local model PSF extent 

Position angle of the major axis of the ellipse defining the local model PSF extent 

Estimated error on the local model PSF extent major axis 

Estimated error on the local model PSF extent minor axis 

Estimated error on the local model PSF extent position angle 

1 a radius along the major axis of the ellipse defining the deconvolved source extent 

1 a radius along the minor axis of the ellipse defining the deconvolved source extent 

Position angle of the major axis of the ellipse defining the deconvolved source extent 

Estimated error on the deconvolved source extent major axis 

Estimated error on the deconvolved source extent minor axis 

Estimated error on the deconvolved source extent position angle 

Center of the source and background apertures, ICRS right ascension 

Center of the source and background apertures, ICRS declination 

Semi-major axis of the elliptical source region aperture 

Semi-minor axis of the elliptical source region aperture 

Position angle of the semi-major axis of the elliptical source region aperture 

Area of the modified elliptical source region aperture (includes corrections for exclusion 

regions due to overlapping sources) 

Semi-major axis of the inner ellipse of the annular background region aperture 

Semi-minor axis of the inner ellipse of the annular background region aperture 

Semi-major axis of the outer ellipse of the annular background region aperture 

Semi-minor axis of the outer ellipse of the annular background region aperture 

Position angle of the semi-major axes of the annular background region aperture 
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Table 1 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



area.aperbkg 


No 


No 


arcsec 2 




mj r_axis_aper90 


Yes 


No 


arcsec 




mnr_axis_aper90 


Yes 


No 


arcsec 




pos_angle_aper90 


Yes 


No 


deg 




area_aper90 


Yes 


No 


arcsec 2 




mjr_axisl_aper90bkg 


Yes 


No 


arcsec 




mnr_axisl_aper90bkg 


Yes 


No 


arcsec 




mjr_axis2_aper90bkg 


Yes 


No 


arcsec 




mnr_axis2_aper90bkg 


Yes 


No 


arcsec 




pos_angle_aper90bkg 


Yes 


No 


deg 




area_aper90bkg 


Yes 


No 


arcsec 2 




psf_frac_aper 


Yes 


No 






psf_frac_aperbkg 


Yes 


No 






psf_frac_aper90 


Yes 


No 






psf_frac_aper90bkg 


Yes 


No 






cnts.aper 


Yes 


No 


counts 




cnts.aperbkg 


Yes 


No 


counts 




src_cnts_aper 


Yes 


No 


counts 




src_rate_aper 


Yes 


Yes 


counts s _1 




photflux.aper 


Yes 


Yes 


photons cm -2 s" 


-1 


fiux.aper 


Yes 


Yes 


erg cm -2 s _1 




fiux_powlaw_aper 


Yes 


Yes 


ergcm s _1 




flux_bb_aper 


Yes 


Yes 


erg cm -2 s _1 




cnts_aper90 


Yes 


No 


counts 




cnts_aper90bkg 


Yes 


No 


counts 




src_cnts_aper90 


Yes 


No 


counts 




src_rate_aper90 


Yes 


Yes 


counts s _1 




photflux_aper90 


Yes 


Yes 


photons cm -2 s" 


-1 



Area of the modified annular background region aperture (includes corrections for exclu- 
sion regions due to overlapping sources) 

Semi-major axis of the elliptical Point Spread Function 90% Enclosed Counts Fraction 
aperture 

Semi-minor axis of the elliptical PSF 90% ECF aperture 

Position angle of the semi-major axis of the elliptical PSF 90% ECF aperture 
Area of the modified elliptical PSF 90% ECF aperture (includes corrections for exclusion 
regions due to overlapping sources) 

Semi-major axis of the inner ellipse of the annular PSF 90% ECF background aperture 
Semi-minor axis of the inner ellipse of the annular PSF 90% ECF background aperture 
Semi-major axis of the outer ellipse of the annular PSF 90% ECF background aperture 
Semi-minor axis of the outer ellipse of the annular PSF 90% ECF background aperture 
Position angle of the semi-major axes of the annular PSF 90% ECF background aperture 
Area of the modified annular PSF 90% ECF background region aperture (includes cor- 
rections for exclusion regions due to overlapping sources) 
Fraction of the PSF included in the modified source region aperture 
Fraction of the PSF included in the modified background region aperture 
Fraction of the PSF included in the modified PSF 90% ECF aperture 
Fraction of the PSF included in the modified PSF 90% ECF background aperture 
Total counts observed in the modified source region aperture 
Total counts observed in the modified background region aperture 
Aperture-corrected net counts inferred from the source region aperture 
Aperture-corrected net count rate inferred from the source region aperture 
Aperture-corrected net photon flux inferred from the source region aperture, calculated 
by counting X-ray events 

Aperture-corrected net energy flux inferred from the source region aperture, calculated 
by counting X-ray events 

Aperture-corrected net energy flux inferred from the source region aperture, calculated 
from an absorbed E~ 17 power-law spectral model 

Aperture-corrected net energy flux inferred from the source region aperture, calculated 
from an absorbed kT = 1.0 keV black-body spectral model 
Total counts observed in the modified PSF 90% ECF aperture 
Total counts observed in the modified PSF 90% ECF background region aperture 
Aperture-corrected net counts inferred from the PSF 90% ECF aperture 
Aperture-corrected net count rate inferred from the PSF 90% ECF aperture 
Aperture-corrected net photon flux inferred from the PSF 90% ECF aperture, calculated 
by counting X-ray events 
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Table 1 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



flux_aper90 Yes 

fiux_powlaw_aper90 Yes 

flux_bb_aper90 Yes 



Hardness Ratios 

hard_(x)(y) 

Spectral Model Fits f 



No 



Yes ergccm - 2 s — 1 

Yes erg cm -2 s -1 

Yes ergcm _2 s _1 

Yes 



flux.powlaw 


No 


Yes 


erg cm 2 s 1 


alpha 


No 


Yes 




nh.powlaw 


No 


Yes 


10 20 cm- 2 


powlaw_stat 


No 


No 




flux_bb 


No 


Yes 


erg cm -2 s _1 


kt 


No 


Yes 


keV 


nh.bb 


No 


Yes 


10 20 cm- 2 


bb_stat 


No 


No 




Temporal Variability 








var .index 


Yes 


No 




var_prob 


Yes 


No 




ks.prob 


Yes 


No 




kp.prob 


Yes 


No 




varjnean 


Yes 


No 


counts s _1 


var_sigma 


Yes 


No 


counts s _1 


var_min 


Yes 


No 


counts s _1 


var_max 


Yes 


No 


counts s _1 



Aperture-corrected net energy flux inferred from the PSF 90% ECF aperture, calculated 

by counting X-ray events 

Aperture-corrected net energy flux inferred from the PSF 90% ECF aperture, calculated 

from an absorbed E~ 1,7 power-law spectral model 

Aperture-corrected net energy flux inferred from the PSF 90% ECF aperture, calculated 

from an absorbed kT = 1.0 keV black-body spectral model 

Spectral hardness ratio measured between ACIS energy bands (x) and (y)\ hard_(x)(y) = 
(flux_aper_(x) — flux_aper_(y))/flux_aper_b 

Net integrated 0.5— lOkeV energy flux of the best power-law model spectral fit to the 

source region aperture PI spectrum 

Photon index (a, defined as F& oc E~ a ) of the best power-law model spectral fit to the 

source region aperture PI spectrum 

Total neutral Hydrogen column density, iVji, of the best power-law model spectral fit to 

the source region aperture PI spectrum 

X 2 (data variance) statistic per degree of freedom of the best power-law model spectral 

fit to the source region aperture PI spectrum 

Net integrated 0.5— lOkeV energy flux of the best black-body model spectral fit to the 

source region aperture PI spectrum 

Temperature (kT) of the best black-body model spectral fit to the source region aperture 

PI spectrum 

Total neutral Hydrogen column density, Wjj, of the best black-body model spectral fit to 

the source region aperture PI spectrum 

X 2 (data variance) statistic per degree of freedom of the best black-body model spectral 

fit to the source region aperture PI spectrum 

Gregory-Loredo variability index in the range [0, 10] 

Gregory-Loredo variability probability 

Kolmogorov-Smirnov variability probability 

Kuiper's test variability probability 

Flux variability mean value, calculated from an optimally-binned light curve 

Flux variability standard deviation, calculated from an optimally-binned light curve 

Flux variability minimum value, calculated from an optimally-binned light curve 

Flux variability maximum value, calculated from an optimally-binned light curve 
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Table 1 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



Source-Specific Observation Summary 
livetime No No 



detector 



No 



No 



Effective exposure time after applying the good time intervals and the deadtime correction 

factor 

Detector elements over which the background region bounding box dithers during the 

observation 



a Indicates that tabulated properties include separate entries for each energy band. The individual band entries are identified by the suffix "_(a;}", where (x) 
is one of the energy band designations listed in Table 4. 

b Indicates that tabulated properties include separate entries for ~ 68% lower and upper confidence limits. The data value is tabulated using the indicated 
property name, while the lower and upper confidence limits are identified by the suffixes "Jolim" and "_hilim," respectively. If a property includes both 
confidence limits and separate entries for each band, then the confidence limit suffix precedes the band designation suffix. 

c In the first release of the catalog, the source position error ellipse is approximated by a circle. 

d Translations for source codes that contain bit-coded values are presented in Table 8. 

e In the first release of the catalog, the deconvolved source extent ellipse is approximated by a circle. The deconvolved source extent is computed if at least 
6 counts are included in the source region aperture; the estimated error is computed if at least 15 counts are included in the deconvolved source extent ellipse. 

'Spectral model fits are only performed if the source has at least 150 net counts in the ACIS broad energy band. 
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Table 2. Master Sources Table Properties 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



Source Name 








name 


No 


No 




Source Position 








ra 


No 


No 




dec 


No 


No 




galJ 


No 


No 


deg 


gal.b 


No 


No 


deg 


err_ellipse_rO 


No 


No 


arcsec 


err_ellipse_rl 


No 


No 


arcsec 


err_ellipse_ang 


No 


No 


deg 



Source Flux Significance (SNR) 
significance No 



No 



Source Flags 








confjlag 


No 


No 




extent_flag 


No 


No 




pilcupjlag 


No 


No 




sat_src_flag 


No 


No 




streak _src_flag 


No 


No 




varjlag 


No 


No 




var .inter _hard_flag 


No 


No 




man_inc_flag 


No 


No 




man_match_flag 


No 


No 




man_reg_flag 


No 


No 




Source Extcnt d 








major_axis 


Yes 


No 


arcsec 


minor_axis 


Yes 


No 


arcsec 


pos_angle 


Yes 


No 


deg 


major_axis_err 


Yes 


No 


arcsec 


minor_axis_err 


Yes 


No 


arcsec 


pos_angle_err 


Yes 


No 


deg 


Aperture Photometry 








photflux.aper 


Yes 


Yes 


photons cm 



Source name in the format "CXO Jhhmmss.s ± ddmmss" 



Source position, ICRS right ascension 

Source position, ICRS declination 

Source position, Galactic longitude 

Source position, Galactic latitude 

Major radius of the 95% confidence level error ellipse 

Minor radius of the 95% confidence level error ellipse 

Position angle of the major axis of the 95% confidence level error ellipse 



Highest source flux significance across all observations 

Source regions overlap (source is confused) 

Deconvolved source extent is inconsistent with a point source at the 90% confidence level 

ACIS pile-up fraction exceeds ~ 10% in all observations; source properties may be affected 

Source is saturated in all observations; source properties are unreliable 

Source is detected on an ACIS readout streak in all observations; source properties may 

be affected 

Source displays flux variability within an observation or between observations 

Source hardness ratios are statistically inconsistent across multiple observations 

Source was manually included in the catalog via human review 

Cross-observation source matching was performed manually via human review 

Source region parameters (location, dimensions) were manually adjusted via human review 

1 a radius along the major axis of the ellipse defining the deconvolved source extent 
1 a radius along the minor axis of the ellipse defining the deconvolved source extent 
Position angle of the major axis of the ellipse defining the deconvolved source extent 
Estimated error on the deconvolved source extent major axis 
Estimated error on the deconvolved source extent minor axis 
Estimated error on the deconvolved source extent position angle 

Aperture-corrected net photon flux inferred from the source region aperture, calculated 
by counting X-ray events 



9 

C 



a 



z 



Table 2 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



:_aper 


Yes 


Yes 


erg cm -2 s — * 


:_powlaw_aper 


Yes 


Yes 


erg cm -2 s — 1 


:_bb_aper 


Yes 


Yes 


erg cm -2 s — * 


tflux_aper90 


Yes 


Yes 


photons cm s 


:_aper90 


Yes 


Yes 


erg cm -2 s _1 


:_powlaw_aper90 


Yes 


Yes 


erg cm -2 s — 1 


:_bb_aper90 


Yes 


Yes 


erg cm -2 s — * 



Spectral Hardness Ratios 
hard_(x)(y) No 



Yes 



Model Spectral Fits e 








flux.powlaw 


No 


Yes 


erg cm 2 s * 


alpha 


No 


Yes 




nh.powlaw 


No 


Yes 


10 20 cm~ 2 


flux.bb 


No 


Yes 


erg cm -2 s — * 


kt 


No 


Yes 


keV 


nh_bb 


No 


Yes 


10 20 cm" 2 


nh.gal 


No 


No 


10 20 cm" 2 


Temporal Variability 








varjntrajndcx 


Yes 


No 





Aperture-corrected net energy flux inferred from the source region aperture, calculated 

by counting X-ray events 

Aperture-corrected net energy flux inferred from the source region aperture, calculated 

from an absorbed E~ 1,7 power-law spectral model 

Aperture-corrected net energy flux inferred from the source region aperture, calculated 

from an absorbed kt = 1.0 keV black-body spectral model 

Aperture-corrected net photon flux inferred from the Point Spread Function 90% Enclosed 

Counts Fraction aperture, calculated by counting X-ray events 

Aperture-corrected net energy flux inferred from the PSF 90% ECF aperture, calculated 

by counting X-ray events 

Aperture-corrected net energy flux inferred from the PSF 90% ECF aperture, calculated 

from an absorbed E~ 1,7 power-law spectral model 

Aperture-corrected net energy flux inferred from the PSF 90% ECF aperture, calculated 

from an absorbed kt = 1.0 keV black-body spectral model 

Spectral hardness ratio measured between ACIS energy bands (x) and (y); hard_(a;} (y) = 
(flux_aper_(:r) — flux_aper_(?/))/flux_aper_b 

Net integrated 0.5— lOkeV energy flux of the best power-law model spectral fit to the 
source region aperture PI spectrum 

Photon index (a, defined as F^ ocE~ a ) of the best power-law model spectral fit to the 
source region aperture PI spectrum 

Total neutral Hydrogen column density, TVh, of the best power-law model spectral fit to 
the source region aperture PI spectrum 

Net integrated 0.5-10 keV energy flux of the best black-body model spectral fit to the 
source region aperture PI spectrum 

Temperature (kT) of the best black-body model spectral fit to the source region aperture 
PI spectrum 

Total neutral Hydrogen column density, ./Vjj , of the best black-body model spectral fit to 
the source region aperture PI spectrum 

Galactic neutral Hydrogen column density, A r H(Gal) in the direction of the source deter- 
mined from Dickey & Lockman (1990) 

Intra-observation Gregory-Loredo variability index in the range [0, 10] (highest value 
across all observations) 
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Table 2 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



var_intra_prob 


Yes 


No 






ks_intra_prob 


Yes 


No 






kp_intra_prob 


Yes 


No 






var_intra_sigma 


Yes 


No 


counts s 1 




var_inter Judex 


Yes 


No 






var_inter_prob 


Yes 


No 






var .inter _sigma 


Yes 


No 


photons cm" 


- 2 s 


Observation Summary 1 










acis.num 


No 


No 






acis_hetg_num 


No 


No 






acis_letg_num 


No 


No 






acis.timc 


No 


No 


s 




acis_hetg_time 


No 


No 


s 




acis_letg_time 


No 


No 


s 




hrcjium 


No 


No 






hrc_letg_num 


No 


No 






hrc_hetg_num 


No 


No 






hrc_time 


No 


No 


s 




hrc_letg_time 


No 


No 


s 





Intra-observation Gregory- Loredo variability probability (highest value across all obser- 
vations) 

Intra-observation Kolmogorov-Smirnov variability probability (highest value across all 
observations) 

Intra-observation Kuiper's test variability probability (highest value across all observa- 
tions) 

Intra-observation flux variability standard deviation, calculated from an optimally-binned 
light curve (highest value across all observations) 

Inter-observation variability index in the range [0, 10]; indicates whether the source region 
photon flux is constant between observations 

Inter-observation variability probability, calculated from the \ 2 distribution of the photon 
fluxes of the individual observations 

Inter-observation flux variability standard deviation; the spread of the individual obser- 
vation photon fluxes about the error weighted mean 

Total number of ACIS imaging observations contributing to the Master Sources Table 
record of the source 

Total number of ACIS/HETG observations contributing to the Master Sources Table 
record of the source 

Total number of ACIS/LETG observations contributing to the Master Sources Table 
record of the source 

Total ACIS imaging exposure time (seconds of good time) for all ACIS imaging observa- 
tions contributing to the Master Sources Table record of the source 

Total ACIS/HETG observation exposure time (seconds of good time) for all ACIS/HETG 
observations contributing to the Master Sources Table record of the source 
Total ACIS/LETG observation exposure time (seconds of good time) for all ACIS/LETG 
observations contributing to the Master Sources Table record of the source 
Total number of HRC imaging observations contributing to the Master Sources Table 
record of the source 

Total number of HRC/LETG observations contributing to the Master Sources Table record 
of the source 

Total number of HRC/HETG observations contributing to the Master Sources Table 
record of the source 

Total HRC imaging exposure time (seconds of good time) for all HRC imaging observations 
contributing to the Master Sources Table record of the source 

Total HRC/LETG observation exposure time (seconds of good time) for all HRC/LETG 
observations contributing to the Master Sources Table record of the source 
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Table 2 — Continued 



Property 



Multi- a Conf. b 
Band Lim. 



Units 



Description 



hrc_hetg_time 



No 



No 



Total HRC/HETG observation exposure time (seconds of good time) for all HRC/HETG 
observations contributing to the Master Sources Table record of the source 



indicates that tabulated properties include separate entries for each energy band. The individual band entries are identified by the suffix "_(x}", where (x) 
is one of the energy band designations listed in Table 4. 

b Indicates that tabulated properties include separate entries for ~ 68% lower and upper confidence limits. The data value is tabulated using the indicated 
property name, while the lower and upper confidence limits are identified by the suffixes "Jolim" and "_hilim," respectively. If a property includes both 
confidence limits and separate entries for each band, then the confidence limit suffix precedes the band designation suffix. 

c In the first release of the catalog, the source position error ellipse is approximated by a circle. 

d In the first release of the catalog, the source extent ellipse is approximated by a circle. 

°Spectral model fits are only performed if the source has at least 150 net counts in the ACIS broad energy band. These properties are copied from the ACIS 
observation with the highest flux_significance in any energy band. 

f The first release of the catalog does not include observations obtained using the High Resolution Camera or observations obtained using the High or Low 
Energy Transmission Gratings. 
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Table 3. File-Based Data Products 



Data a 
Product 



File Name b 
Specifier 



Description 



Full-Field Data Products 

Event List evt3 

Image (cc)_img3 

Image (JPEG) <&)-img3 



Background Image (x)_bkgimg3 

Exposure Map (x)_exp3 

Sensitivity Map (x)_sens3 

Aspect Histogram ahst3 

Bad Pixel Map bpix3 

Field of View fov3 

Source Region Data Products' 1 

Source Region reg3 

Event List regevt3 

Image (a;)_regimg3 

Image (JPEG) (i)_regimg3 

Image 3-color (JPEG) reg3img3 

Exposure Map (x)_regexp3 

Point Spread Function (ic)_psf3 

Point Spread Function (JPEG) (x).psf3 

ARF arf3 

RMF rmf3 

PI Spectrum pha3 

Light Curve (i)_lc3 



Photon event list, with associated Good Time Intervals (GTIs), recorded in consecutive FITS Hierar- 
chical Data Units (HDUs) 

Per-energy-band background-subtracted, exposure corrected images (photons cm -2 s _1 ) 
Background-subtracted, exposure corrected images; 3-color JPEG encoding for ACIS observations 
(soft/medium/hard) energy bands color coded as (red/green/blue); monochromatic JPEG encoding 
for HRC observations 

Per-energy-band background images c (counts); includes high spatial frequency "readout streak" com- 
ponent for ACIS observations 

Per-energy-band exposure map images c (cm s photon -1 ) computed at the band monochromatic effec- 
tive energy 

Per-energy-band limiting sensitivity images c (photons cm -2 s _1 ); minimum photon flux per energy 
band required for a point source to satisfy the flux significance threshold necessary for inclusion in the 
catalog, as a function of position in the field of view 

Table of X, Y offsets (pixels) and roll-angle offsets (deg) vs. time due to spacecraft dither motion 
Detector bad pixel region-definitions, including observation-specific bad pixels 
Observation-specific sky field of view region-definitions 

Modified source region aperture and background region aperture region-definitions 
Photon event list, with associated GTIs recorded in consecutive FITS HDUs 
Per-energy-band background-subtracted, exposure corrected images (photons cm -2 s -1 ) 
Per-energy-band background-subtracted, exposure corrected images e ; monochromatic JPEG encoding 
(ACIS only) Exposure corrected image ; 3-color JPEG encoding for ACIS observations (soft/medium/ 
hard) energy bands color coded as (red/green/blue) 

Per-energy-band exposure map images (cm 2 s photon" 1 ) computed at the band monochromatic effec- 
tive energy 

Per-energy-band local model point spread function images computed at the band monochromatic ef- 
fective energy 

Per-energy-band local model point spread function images computed at the band monochromatic ef- 
fective energy 

Ancillary response file; table of telescope plus detector effective area (cm 2 ) vs. energy bin 
(ACIS-only) Detector redistribution matrix file 

(ACIS-only) Per-energy-band pulse-invariant source region aperture and background region aperture 
spectra, with associated GTIs, in consecutive FITS HDUs 
Per-energy-band optimally-binned light curve, computed using the Gregory- Loredo formalism 
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Table 3 — Continued 



Data 3, File Name b Description 

Product Specifier 



a All data products are recorded in FITS format, except where noted. Files are named (instr)f (obsid)-(obi)N(ver)-.[r(region-id)](specifier) . (ext), where 
(instr) is either ACIS or HRC, (obsid) is the five digit observation identifier, (obi) is the three digit observation interval number, (ver) is the file processing 
version number, (region.id) is the source region identifier, (specifier) is the file name specifier listed in the table, and (ext) is fits for FITS format files and 
jpg for JPEG format files; the region identifier element (enclosed in square brackets) is only present for source region data products. 

h (x) designates the energy band, one of b, s, m, or h for ACIS, and w for HRC; (b) is the image blocking factor. 

c Multiple blocked images are recorded in consecutive FITS HDUs; several blocking factors are used to bin multiple sky pixels into single image pixels, as 
described in § 3.4. 

d Source region data product images include the rectangular region, oriented along the cardinal directions, that bounds the background region aperture. 

"Source region data product images are blocked at the same blocking factor as the smallest corresponding full-field image that includes the background 
region aperture bounding box. ^ 

$ 
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Source Observations Table Entries 



CXO J 162623. 5-242439 



'> 



CXOJ162624.0-24244X 



D^ 



CXO J 162625.3-242444 
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Obsld 637 region 101 



Obsld 619 region 5 



Obsld 635 region 99 



Obsld 637 region 6 



Obsld 637 region 33 



Unique linka^t'.s 



^ ^ Ambiguous linkages 

Fig. 5. — Linkages between the Master Sources Table and the 
Source Observations Table entries for the source detections from 
Fig. 4 are depicted. The 3 source detections in observation 00637 
are uniquely identified with distinct X-ray sources on the sky, and 
will be associated with the corresponding master sources through 
"unique" linkages. Similarly, the single source detection (region 5) 
in observation 00619 is an unambiguous match to region 6 in obser- 
vation 00637, and so will also be associated with the same master 
source via a unique linkage. The confused detection, region 99 in 
observation 00635 overlaps the 3 source detections in observation 
00637, and so is associated with the corresponding master sources 
with "ambiguous" linkages. 



for each distinct X-ray source are recorded in the Mas- 
ter Sources Table. A description of the data columns 
recorded in the Master Sources Table for each source is 
provided in Tabic 2. 

Each distinct X-ray source is thus conceptually rep- 
resented in the catalog by a single entry in the Master 
Sources Table, and one or more associated entries in the 
Source Observations Table (one for each observation in 
which the source was detected). 

All of the tabulated properties included in both the 
Master Sources Table and the Source Observations Table 
can be queried by the user. Bi-directional links between 
the entries in the two tables are managed transparently 
by the database, so that the user can access all observa- 
tion data for a single source seamlessly. 

If a source detection included in the Source Observa- 
tions Table can be related unambiguously to a single X- 
ray source in the Master Sources Table, then the cor- 
responding table entries will be associated by "unique" 
linkages. Source detections included in the Source Obser- 
vations Table that cannot be related uniquely to a single 
X-ray source in the Master Source Table will have their 
entries associated by "ambiguous" linkages (Fig. 5). 

The data from ambiguous source detections are not 
used when computing the best estimates of the source 
properties included in the Master Sources Table. In the 
case of ACIS observations, source det ections for w hich 
the estimated photon pile- up fraction (|Davisll2007al) ex- 
ceeds ~ 10% will not be used if source detections in other 
ACIS observations do not exceed this threshold. 

Using the linkages between the entries in the two ta- 
bles, the user will nevertheless be able to identify all of 
the X-ray sources in the catalog that could be associ- 
ated with a specific detection in a single observation, 
and vice-versa. These linkages may be important, for 
example, when identifying candidate targets for follow- 
up studies based on a data signature that is only visible 



in the observation data for a confused source. 

2.4. Data Access 

The primary user tool for que rying the CSC is th e 
CSCview web-browser interface (jZografou et al. 2008), 
which can be accessed from the public catalog web-sitd3- 
The user can directly query any of the tabulated prop- 
erties included in either the Master Sources Table or the 
Source Observations Table, display the contents of an 
arbitrary set of properties for matching sources, and re- 
trieve any of the associated file-based data products for 
further analysis. CSCview provides a form-based data- 
mining interface, but also allows users to enter queries 
written using the Astron omical Data Query Language 
(ADQL; lOrtiz et all [20081, standard directly. Query re- 
sults can be viewed directly on the screen, or saved to 
a data file in multiple formats, including tab-delimited 
ASCII (which can be read directly by several commonly 
used astronomical applications) and International Vir- 
tual Observatory Alliance ^ ( IVOA) standard formats 
such as VOTable (|Ochsenbeinll2009l ). 

Automated access to query the catalog from data anal- 
ysis applications and scripts running on the user's home 
platform was identified as being needed for several sci- 
ence use cases. VO standard interface s, including Sim- 
ple Cone S earch dWilliams et al.l 120081) and Simple Im- 
age Access (jTody fc Plantel l2009). provide limited query 
and data access capabilities, while more sophisticated in- 
teractions are possible through a direct URL connection. 
Support for VO workflows using applicable standards will 
be added in the future as these standards stabilize. An 
interface that integrates catalog access with a visual sky 
browser provides a simple mechanism for visualizing the 
regions of the sky included in the catalog, and may also 
be particularly beneficial for education and public out- 
reach purposes. 

Since Chandra is an ongoing mission, the CSC includes 
a mechanism to permit newly released observations to be 
added to the catalog and be made visible to end users, 
while at the same time providing stable, well-defined, 
and statistically well-characterized released catalog ver- 
sions to the community. This is achieved by maintaining 
a revision history for each database table record, together 
with flags that establish whether catalog quality assur- 
ance and catalog inclusion criteria are met, and using 
distinct views of the catalog databases that utilize these 
metadata. 

"Catalog release views" provide access to each released 
version of the catalog, with the latest released version 
being the default. Catalog releases will be infrequent 
(no more than of order 1 per year) because of the con- 
trols built in to the release process, and because of the 
requirement that each release be accompanied by a de- 
tailed statistical characterization of the included source 
properties. Once data are included in a catalog release 
view, then they are frozen in that view, even if the source 
properties are revised or the source is deleted in a later 
catalog release. A source may be deleted if the detection 
is subsequently determined to be an artifact of the data 
or processing, but the most likely reason that a source is 



4 http://cxc.cfa.harvard.edu/csc/ 

5 http://www.ivoa.net/ 
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deleted from a later catalog release is that additional ob- 
servations included in the later release resolve the former 
detection into multiple distinct sources. 

"Database views" provide access to the catalog 
database, including any new content that may not be 
present in an existing catalog release. Because on-going 
processing is continually modifying the catalog database, 
tabulated data and file-based data products in a database 
view may be superseded at any time, and the statistical 
properties of the data are not guaranteed. 

We anticipate that users who require a stable, well- 
characterized dataset will choose primarily to access the 
catalog through the latest catalog release view. On the 
other hand, users who are interested in searching the 
latest data to identify sources with specific signatures 
for further study will likely use the latest database view. 

2.5. Data Content 

The first release of the CSC includes detected sources 
whose flux estimates are at least 3 times their estimated 
1 cr uncertainties, which typically corresponds to about 
10 net (source) counts on-axis and roughly 20-30 net 
counts off-axis, in at least one energy band. In this re- 
lease, multiple observations of the same field are not com- 
bined prior to source detection, so the flux significance 
criterion applies to each observation separately. 

For each source detected in an observation, the catalog 
includes approximately 120 tabulated properties. Most 
values have associated lower and upper confidence lim- 
its, and many are recorded in multiple energy bands. The 
total number of columns included in the Source Obser- 
vations Table (including all values and associated confi- 
dence limits for all energy bands) is 599. 

Roughly 60 master properties are tabulated for each 
distinct X-ray source on the sky, generated by combining 
measurements from multiple observations that include 
the source. Combining all values and associated confi- 
dence limits for all energy bands yields a total of 287 
columns included in the Master Sources Table. 

The tabulated source properties fall mostly into the 
following broad categories: source name, source posi- 
tions and position errors, estimates of the raw (mea- 
sured) extents of the source and the local point spread 
function, and the deconvolved source extents, aperture 
photometry fluxes and confidence intervals measured or 
inferred in several ways, spectral hardness ratios, power- 
law and thermal black-body spectral fits for bright (> 
150 net counts) sources, and several source variability 
measures (Gregory-Loredo, Kolmogorov-Smirnov, and 
Kuiper tests). 

Also included in the CSC are a number of file-based 
data products in formats suitable for further analysis in 
CIAO. These products, described in Table 3, include 
both full-field data products for each observation, and 
products specific to each detected observation-specific 
source region. 

The full-field data products include a "white-light" 
full- field photon event list, and multi-band expo- 
sure maps, background images, exposure-corrected and 
background-subtracted images, and limiting sensitivity 
maps. 

Source-specific data products include a white-light 
photon event list, the source and background region def- 
initions, a weighted ancillary response file (the time- 
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Fig. 6. — Chandra effective area and average ACIS quiescent 
background as a function of energy. The blue and cyan curves 
present the combined HRMA plus ACIS effective area at the ACIS- 
S aimpoint, with zero and the late 2009 level of focal plane contam- 
ination, respectively. The red and cyan curves show the effective 
area at the ACIS-I aimpoint, again with zero and late 2009 con- 
tamination, respectively. The dotted black line shows the quiescent 
background flux density on the ACIS S3 CCD, while the solid black 
line represents the ACIS 13 CCD background. The energies cor- 
responding to the edges of the CSC energy bands are shown by 
vertical dashed lines. 

averaged product of the combined telescope/instrument 
effective area and the detector quantum efficiency), 
multi-band exposure maps, images, model ray-trace PSF 
images, and optimally binned light-curves. Observations 
obtained using the ACIS instrument additionally include 
low-resolution (E/AE ~ 10-40, depending on incident 
photon energy and location on the array) source and 
background spectra and a weighted detector redistribu- 
tion matrix file (the probability matrix that maps photon 
energy to detector pulse height). 

2.5.1. Energy Bands 

The energy bands used to derive many CSC properties 
are defined in Table 4. The energy bands are chosen to 
optimize the detectability of X-ray sources while simulta- 
neously maximizing the discrimination between different 
spectral shapes on X-ray color-color diagrams. 

The effective area of the telescope (including both 
the Chandra High Resolution Mirror Assembly [HRMA] 
and the detectors) is shown in Figure 6 as a func- 
tion of energy, together with the average ACIS quies- 
cent b ackgrounds derived from blank sky observations 
(Markcvitch 2001a). The effective area is measured at 
the locations of the nominal "ACIS-S" aimpoint on the 
ACIS S3 CCD, and the nominal "ACIS-I" aimpoint on 
the ACIS 13 CCD. 

Where possible, the energy bands are chosen to avoid 
large changes of effective area within the central region 
of the band, since such variations degrade the accuracy 
of the monochromatic effective energy approximation de- 
scribed below. For example, the M-edge of the Iridium 
coating on the HRMA has significant structure in the 
~ 2.0-2.5 keV energy range that provides a natural break- 
point between the ACIS medium and hard energy bands. 
Note however, that large effective area variations are un- 
avoidable within the ACIS broad and soft energy bands 
and the HRC wide energy band. 

Weighting the effective area by the source spectral 
shape and integrating over the bandpass provides an in- 
dication of the relative detectability of a source in the 
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TABLE 4 
CSC Energy Bands 



Band 
Name Designation 


Energy a Monochromatic 51 
Range Energy 


Integrated Effective A 
ACIS-I ACIS-S 


rea b 
HRC-I 




ACIS Energy Bands 


Ultra-soft 

Soft 

Medium 

Hard 

Broad 


u 
s 

m 
h 

b 


0.2-0.5 0.4 
0.5-1.2 0.92 
1.2-2.0 1.56 
2.0-7.0 3.8 
0.5-7.0 2.3 


7.36-2.24 

216-155 

438-401 

1590-1580 

2240-2140 


68.7-23.0 

411-274 

539-493 

1680-1670 

2630-2440 




HRC Energy Band 


Wide 


w 


0.1-10 1.5 






605 



a keV. 

b keV cm 2 , computed at the ACIS-I, ACIS-S, and HRC-I aimpoints. For ACIS energy 
bands, the pair of values are the integrated effective area with zero focal plane con- 
tamination (first number) and with the late 2009 level of focal plane contamination 
(second number). 



energy band. Selecting energy band boundaries so that 
source detectability is roughly the same in different en- 
ergy bands more uniformly distributes Poisson errors 
across the bands, and so enhances detectability in the 
various bands. 

Several different source types were simulated when se- 
lecting energy bands. These included absorbed non- 
thermal (power-law) models with photon index val- 
ues ranging from 1 to 4, absorbed black-body mod- 
els with temperature varying from 20 eV to 2.0 keV, 
and absorbed, hot, opt ically-thin thermal plasma mod- 
els ([Raymond & Smith! fl977l) with kT = 0.25-4.0 keV. 
In all cases, the Hydrogen absorbing column was varied 
over the range 1.0 x 10 20 -1.0 x 10 22 cm~ 2 Detected X - 
ray spectra were simulated using PIMMS (Mukai 2009), 
and then folded through the bandpasses to construct 
synthetic X-ray color-color diagrams (see Fig. 7 for ex- 
ample color-color diagrams based on the final band 
parameters). Energy bands chosen to fill the color- 
color diagrams maximally provide the best discrimina- 
tion between different spectral sha pes. For detailed 
X-ray spectral-line modeling, the iRavmond fe Smith! 
(|1977h models have been superseded by more recent X- 
ray p lasma models (e.g.. iMewe et al1l!99 5: Smith et al.l 
l2001f ). However, since the radiated power of the newer 
models as a function of temperature is not significantly 
differe nt from the 1993 versions of the Raymon d fe Smith! 
(|1977l ) models used here, the latter are entirely adequate 
for the purpose of evaluating coverage of the X-ray color- 
color diagrams and the task is greatly simplified because 
o f their availabi l ity in PIMMS^f. 

I Grimm et al.1 (|2009[) compared broad band X-ray pho- 
tometry with accurate ACIS spectral fits and found that 
model-independent fluxes could be derived from the pho- 
tometry measurements to an accuracy of about 50% or 
better for a broad range of plausible spectra. They used 
similar but not identical energy bands to those adopted 
for the CSC, but did not use the method of deriving 
fluxes from individual photon energies emp loyed herein. 

Combining all of these considerations (McCollough 

6 The newer Mekal and APEC models are included in PIMMS 
v4.0. 



I2007f ) yields the following selection of energy bands for 
the CSC. 

The ACIS soft (s) energy band spans the energy range 
0.5-1.2 keV. The lower bound is a compromise that is set 
by several considerations. ACIS calibration uncertain- 
ties increase rapidly below 0.5 keV, so this establishes a 
fairly hard lower limit to avoid degrading source mea- 
surements in the energy band. As shown in Figure 8, 
below about 0.6 keV the background count rate begins 
to increase rapidly, while the integrated effective area 
rises very slowly resulting in few additional source counts. 
While pushing the band edge to higher energy will result 
in a lower background, the integrated effective area drops 
rapidly if the lower bound is raised above ~0.8keV, re- 
ducing the number of source counts collected in the band. 
We choose to set the lower bound equal to 0.5 keV since 
doing so enhances the detectability of super-soft sources, 
while not noticeably impacting measurements of other 
sources. The upper cutoff for the soft energy band is set 
equal to 1.2 keV, which balances the preference for uni- 
form integrated effective areas amongst the energy bands 
with the desire to maximize the area of X-ray color-color 
plot parameter space spanned by the simulations. 

The lower bound of the ACIS medium (m) energy band 
matches the upper bound of the soft energy band. We 
locate the upper band cutoff at 2.0 keV since this value 
tends to maximize the coverage of the X-ray color-color 
diagram. This value also moves the Iridium M-edge out 
of the sensitive medium band, and instead placing it im- 
mediately above the lower boundary of the ACIS hard 
(h) energy band. 

The high energy boundary of the latter band is set 
to 7.0 keV. This cutoff provides a good compromise be- 
tween maximizing integrated effective area and minimiz- 
ing total background counts (Fig. 8). Above 7.0 keV, the 
background rate increases rapidly at the ACIS-S, while 
below this energy the integrated effective area decreases 
rapidly at the ACIS-I aimpoint. Placing the hard en- 
ergy band cutoff at 7.0 keV also has the advantage that 
the Fe Kcv line is included in the band, allowing intense 
Fe line sources to be detected without compromising the 
measurement quality for typical catalog sources. 
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Fig. 7. — Synthetic color-color diagrams computed for the ACIS hard (h), medium (m), soft (s), and broad (b = h + m + s) energy bands. 
Left: Absorbed power-law models. The solid lines are lines of constant photon index Y = 1.0, 2.0, 3.0, and 4.0 (from right to left). The 
dashed lines are lines of constant neutral Hydrogen column densities JVh = 1.0 X 10 20 , 1.0 X 10 21 , 2.0x 10 21 , 5.0 X 10 21 , and 1.0 X 10 22 cm -2 
(from bottom to top). Right: Hot, optically thin thermal plasma models (Raymond & Smith 1977). The solid lines are lines of constant 
temperature kT = 0.25, 0.5, 1.0, 2.0, and 4.0keV (from left to right). The dashed lines are lines of constant neutral Hydrogen column 
densities 7V H = 1.0 X 10 20 , 1.0 X 10 21 , 2.0 X 10 21 , 1.4 X 10 22 , and 1.75 X 10 22 cm -2 (from bottom to top). Energy bands were chosen to 
optimize the ability to estimate spectral parameters from color-color diagrams. 



The ACIS broad (6) band covers the same energy range 
as the combined soft, medium, and hard bands, and 
therefore spans the energy range 0.5-7.0 keV. 

Simulations indicate that an additional energy band 
extending below 0.5 keV is beneficial for discriminat- 
ing super-soft X-ray sources in color-color plots. The 
ACIS front-illuminated CCDs have minimal quantum ef- 
ficiency below 0.3 keV, while the response of the back- 
illuminated CCDs extends down to ~ 0.1 keV. Hydro- 
carbon contamination is present on both the HRMA 
opti cs (|Jeriusl 120051) and the ACIS optical blocking fil- 



;i cs (IJei 
(Marsh 



ter ((Marshall et all 120041) . The latter reduces the effec- 
tive area at low energies, and enhances the depth of the 
Carbon K-edge. An ACIS ultra-soft (u) band covering 
0.2-0.5 keV is added to provide better discrimination of 
super-soft sources. Source detection is not performed 
in this energy band, because of the typical lower overall 
signal-to-noise ratio (SNR) and the resulting enhanced 
false-source rate. 

Finally, since the HRC (particularly HRC-I) has min- 
imal spectral resolution, a single wide (w) band that 
includes essentially the entire pulse height spectrum 
(specifically, PI values : 254), roughly equivalent to 
0.1-10 keV, is used for HRC observations. 

While bands in these general energy ranges give the 
best balance of count rate and spectral discrimination, 
our simulations indicate that the exact choice of band 
boundary energies is not critical at the 10% level. 

2.5.2. Band Effective Energies 

In principle, the variations of HRMA effective area, 
detector quantum efficiency, and (for ACIS) focal plane 
contamination, with energy imply that energy-dependent 
data products such as exposure maps or PSFs should be 
constructed by integrating the source spectrum over the 
energy band. This approach would be both extremely 
time-consuming, and require knowledge of the source 
spectrum that is typically not available a priori. In prac- 
tice, a monochromatic effective energy is chosen for each 
energy band t o be used to const ruct energy dependent 
data products (|McCorloughll2007l ). 

The monochromatic effective energy for each band is 



determined using the relation 



E, 



off 



/ dE EA{E)Q{E)C{E)S{E) 
J dE A(E)Q(E)C(E)S{E) ' 



(1) 



where E is the energy, A is the effective area of the 
HRMA, Q is the detector quantum efficiency, C is the 
reduction in transmission due to focal plane contamina- 
tion, S is a power-law spectral weighting function of the 
form (E/Eo)~ a , and the integral is performed over the 
energy band. 

The monochromatic effective energies for each energy 
band were calculated for sources located at the ACIS-I 
and ACIS-S aimpoints, and also for the nominal aimpoint 
on the HRC-I detector. Since the CSC is constructed 
from observations acquired throughout the Chandra mis- 
sion, ACIS focal plane contamination models with both 
zero contamination (appropriate for observations ob- 
tained early in the mission) and the contamination level 
current as of late 2009 were employed. Power-law spec- 
tral weighting functions with a varying from 0.0 to 2.0 
were used. Setting a = 1 gives a spectral weighting func- 
tion that approximates an absorbed T = 1.7 power-law 
spectrum, and the limits for a were chosen to span the 
typical range of values determined from fits to a canon- 
ical subset of Chandra datasets. The remaining param- 
eters in equation (fTJ are ext racted from the Chandra 
calibration dat abase (CalDB; iGeorge fc Corcoran! 120051 : 
IGraessle et al.lr2~006D . The monochromatic effective ener- 
gies for ACIS were chosen to be the approximate arith- 
metic means of the a = 1 values derived for the ACIS- 
I and ACIS-S aimpoints, with zero and late 2009 focal 
plane contamination. For ACIS energy bands other than 
the the broad band, the monochromatic effective ener- 
gies computed for a single value of a all agree within 
< 0.1 keV. The dependence on a is similarly small, ex- 
cept for the hard energy band, where varying a from 0.0 
to 2.0 changes the monochromatic effective energy from 
— 4.2 keV to ~ 3.4 keV. For the ACIS broad energy band, 
the agreement between the different models for a single 
value of a is ~±0.3keV. However, for this band the de- 
pendence on a is more significant, varying from ^3.3 keV 
for a = 0.0 to ~1.6keV for a = 2.0. The monochromatic 
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Fig. 8. — Left: Plot shows how the ACIS soft (s) energy band integrated effective area and background count rate per CCD vary with 
the choice of lower bound for the energy band. Markers for different lower bounds are shown. The individual curves show the relationship 
at the ACIS-S and ACIS-I aimpoints, and with zero and the late 2009 level of focal plane contamination, as follows. Solid line: ACIS-S 
aimpoint, no contamination; dotted line: ACIS-S aimpoint, late 2009 contamination; dashed line: ACIS-I aimpoint, no contamination; 
dash-dotted line: ACIS-I aimpoint, late 2009 contamination. Right: Plot shows how the ACIS hard (h) energy band integrated effective 
area and background count rate per CCD vary with the choice of upper bound for the energy band. Markers and line styles are the same 
as in the left panel. 



effective energies used to construct the CSC are reported 
in Table 4. 

Although the use of a single monochromatic effective 
energy for each energy band simplifies data analysis by 
removing the dependence on the source spectrum, some 
error will be introduced for sources that have either ex- 
tremely soft or extremely hard spectra compared to the 
canonical a — 1.0 power-law spectral weighting function. 
Knowledge of the expected magnitude of the error that 
may be introduced is helpful when evaluating catalog 
properties. 

For both the ACIS medium and hard energy bands, 
neither extremely soft nor extremely hard source spectra 
induce variations in exposure map levels that are greater 
than ~10%, so photometric errors due to source spectral 
shape should not exceed this value. In the ACIS soft 
energy band, very soft spectra may produce deviations 
of order 5-20%, with the largest excursions expected for 
the front-illuminated CCDs. These differences increase 
to ~ 15-35% for the ACIS ultra-soft energy band, with 
the largest values once again associated with the front- 
illuminated CCDs. For all of the ACIS narrow energy 
bands, the errors induced by extremely hard spectra are 
much smaller than those caused by extremely soft spec- 
tra. The presence of the Iridium edge and the large en- 
ergy ranges included in the ACIS broad and HRC wide 
energy bands may produce significantly larger variations 
for extreme spectral shapes. Very soft spectra can alter 
exposure map values by ~ 65-90% in the ACIS broad 
energy band, although there is little impact in the HRC 
wide energy band. Conversely, extremely hard spectra 
may induce changes up to ~ 70% in the HRC wide energy 
band, and ~ 25-30% in the ACIS broad energy band. As 
described in § 4.4, model-based statistical characteriza- 
tion of CSC source fluxes (F. A. Primini et al. 2010, in 
preparation) produces results that are generally consis- 
tent with these expectation, with the exception that flux 
errors in the ACIS broad energy band appear to be ~ 10% 
for most sources. 

When computing fluxes for point sources, an aperture 
correction is applied to compensate for the fraction of 
the PSF that is not included in the aperture. Since the 



extent of the Chandra PSF varies with energy, using a 
monochromatic effective energy can introduce a flux er- 
ror because the energy dependence of the PSF fraction is 
not considered. This error can be bounded by a post facto 
comparison of PSF fractions for catalog source detections 
in the 5 ACIS energy bands. The majority of variations 
between energy bands fall in the range 4-8%, with 90% 
of source detections showing < 10% differences. These 
values represent an upper bound on the error introduced 
within an energy band by the use of a monochromatic 
effective energy. 

2.5.3. Coordinate Systems and Image Binning 

As described previously, X-ray photon event data are 
recorded in the form of a photon event list. The pixel 
position on the detector where a photon was detected is 
recorded in the "chip" pixel coordinate system. Event 
positions are remapped to celestial coordinates t hroug h 
a series of transforms, as described bv iMcDowelll ([2001). 
The first step in this process remaps chip coordinates 
to a uniform real-valued virtual "detector" pixel space 
by applying corrections for the measured detector ge- 
ometry, and instrumental and telescope optical system 
distortions recorded in the CalDB. Subsequent applica- 
tion of the time-dependent aspect solution removes the 
spacecraft dither motion, and maps the event positions 
to a uniform virtual "sky" pixel plane. The latter has 
the same pixel scale as the original instrumental pixels, 
but is oriented with North up (+V direction) and is cen- 
tered at the celestial coordinates of the tangent plane 
position for the observation. As an aid to users, the lo- 
cation of each event in each coordinate system is recorded 
in the calibrated photon event list. A simple unrotated 
world coordinate system transform maps sky positions 
to ICRS right ascension and declination by applying the 
plate scale calibration to the difference between the posi- 
tion of the source and a fiducial point, which is typically 
the optical axis of the telescope. The celestial coordi- 
nates of the fiducial point are determined from the aspect 
solution. 

Sky images are constructed from the calibrated photon 
event lists by binning photon positions in sky coordinates 
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Fig. 9. — High-level flow diagram depicting the steps used to process each observation's full ficld-of-view, detect sources, and extract the 
physical properties for each detected source. The references identify the relevant sections of the text that describe in detail the methods 
used. 



into a regular, rectangular image pixel grid. A conse- 
quence of constructing images by binning in sky coordi- 
nates is that Chandra images are always oriented with 
North up. The choice of image blocking factor deter- 
mines the number of sky pixels that are binned into a sin- 
gle image pixel. Full field image products associated with 
ACIS observations are constructed by binning the area 
covered by the inner 2048 x 2048 sky pixels at single pixel 
resolution, then binning the inner 4096 x 4096 sky pix- 
els at block 2, and finally binning the entire 8192 x 8192 
sky pixel field at block 4. The corresponding blocking 
factors for HRC-I observations are 2, 5, and 12. Using 
a constant blocked image size of 2048 x 2048 pixels re- 
duces overall data volume, while preserving resolution in 
the outer areas of the field of view where the PSF size is 
significantly larger than a single pixel. 

3. CATALOG GENERATION 

In this section, we describe in detail the methods used 
to derive the X-ray source properties that are included in 
the CSC, with particular detail provided in cases where 
the algorithms are new or have been newly adapted for 
use with Chandra data. 

The principal steps necessary to generate the cata- 
log consist of processing the data for each observation's 
full field-of-view, detecting X-ray sources included within 
that field of view, and then extracting the spatial, pho- 
tometric, spectral, and temporal properties of each de- 
tected source. Figure 9 is a depiction of the high-level 
flow used to perform these steps. In the figure, each block 
references the section of the text that describes in detail 
the methods used. The physical properties associated 
with each source detection are recorded as a separate 
row in the catalog Source Observations Table. 

Once the source detections from each observation have 
been evaluated, they are correlated with source detec- 
tions from all other spatially overlapping observations to 
identify distinct X-ray sources on the sky. The steps re- 
quired to perform the source cross-matching, and then 



combine the data from multiple observations of a sin- 
gle source to evaluate the source's properties, follow a 
similar flow to the one presented in Figure 10. Many of 
the elements that comprise the second flow are built on 
the foundations developed for the related steps from the 
first flow. For convenience and continuity of notation, 
the former are described in the same text sections as the 
latter. The properties for each distinct X-ray source are 
included as a separate row in the catalog Master Sources 
Table. 

Data processing for release 1 of the CSC was performed 
using versions 3.0-3. 0.7 of the Chandra X-ray Center 
data system (CXCDS; Eva ns et al]l2006al lbl) catalog pro- 
cessing system ("CAT"), with calibration data extracted 
from CalDB version 3.5.0. The observation recalibration 
steps included in CAT3.0 correspond approximately with 
those included CIAO 4.0. In several cases, programs de- 
veloped for CAT3.0 to evaluate source properties have 
been repackaged with new interfaces for interactive use 
in subsequent CIAO releases (see Table 5). 

3.1. Observation Selection 

While the CSC ultimately aims to be a comprehen- 
sive catalog of X-ray sources detected by Chandra, all 
of the functionality required to achieve that goal are not 
included in the release 1 processing system. A set of 
pre-filters is used to limit the data content to the set of 
observations that the catalog processing system is capa- 
ble of handling. 

For release 1, only public ACIS "timed-exposure" read- 
out mode imaging observations obtained using either the 
"faint," "very faint," or "faint with bias" datamodes are 
included. ACIS observations that are obtained using 
CCD subarrays with < 128 rows are also excluded, be- 
cause there are too few rows to ensure that source-free 
regions can be identified reliably when constructing the 
high spatial frequency background map. HRC-I imag- 
ing mode observations are not included in release 1 of 
the catalog, but are included in incremental release 1.1. 
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Fig. 10. — As Fig. 9, except that the steps used to cross-match source detections, and combine data from multiple observations to evaluate 
source properties, are shown. 



HRC-S observations are excluded because of the pres- 
ence of background features associated with the edges of 
"T" -shaped energy-suppression filter regions that form 
part of the UV/ion-shield. Observations of solar system 
objects are not included in the CSC. 

All observations included in the CSC must have been 
processed using the standard data processing pipelines 
included in version 7.6.7, or later, of the CXCDS. This 
version of the data system was used to perform the most 
recent bulk reprocessing of Chandra data, and includes 
revisions to the pipelines that compute the aspect so- 
lution that is used to correct for the spacecraft dither 
motion and register the source events on the sky. Ob- 
servations must have successfully passed the "validation 
and verification" (quality assurance) checks that are per- 
formed upon completion of standard data processing. 

The largest scale lengths used to detect sources to 
be included in the CSC have angular extents ~ 30". 
Sources with apparent sizes greater than this are either 
not detected, or may be detected incorrectly as multiple 
close sources. Prior to catalog construction, all observa- 
tions are inspected visually for the presence of extended 
sources that may be detected incorrectly, and such obser- 
vations are excluded from catalog processing. For ACIS 
observations, if the presence of any spatially extended 
emission is restricted to a single CCD only, then the data 
from that CCD are dropped, and sources detected on any 
remaining CCDs are typically included in the catalog. 
The latter rule allows many sources surrounding bright, 
extended cores of galaxies to be included in the catalog, 
rather than having the entire observation rejected out- 
right. 

While the visual inspection and rejection process is 
inherently subjective in nature, an attempt was made 
to calibrate the method by constructing a "training 
set" of several hundred observations that were processed 
through a test version of the catalog pipelines. The train- 
ing set observations included a wide variety of point, 
compact, and extended sources, with differing exposures 
and SNR, which were classified as accept /reject based on 
the actual results of running the pipeline source detection 
and source property extraction steps. These observations 
and classifications were then used to train the personnel 
who performed the visual inspection process. 



3.2. Observation Recalibration 

Although all observations included in the CSC have 
been processed through the CXCDS standard data pro- 
cessing pipelines, we nevertheless re-run the instrument- 
specific calibration steps as the first step in catalog con- 
struction. One reason for reapplying the instrumental 
calibrations is that they are subject to continuous im- 
provements, and may have been revised since the last 
time the observations were processed or reprocessed. A 
second reason is to ensure that a single set of calibrations 
are applied to all datasets, so that the resulting catalog 
will be calibrated as homogeneously as possible. 

For ACIS, the principal instrument-specific calibra- 
tions that are re-applied are the (time-dependent) gain 
calibration and the correction for CCD charge transfer 
inefficiency (CTI). The former calibration maps the mea- 
sured pulse height for each detected X-ray event into 
a measurement of the energy of the corresponding inci- 
dent X-ray photon. CTI correction attempts to account 
for charge lost to traps in the CCD substrate when the 
charge is being read out. This effect is considerably larger 
than anticipated prior to launch because of damage to 
the ACIS CCDs caused by the spacecraft's radiation en- 
vironment. Additionally, observation-specific bad pixels 
and hot pixels are flagged for removal, as are "streak" 
events on CCD S4 (ACIS-8). The latter ap parently resul t 
from a flaw in the serial readout electronics (JHpuck 2000) . 
Pixel afterglow events, which arise because of energy de- 
posited into the CCD substrate by cosmic ray charged 
particles, are removed using the acisjrun_hotpix tool 
that is also included in CIAO. Although this program 
can miss some real faint afterglows, such events are very 
unlikely to exceed the flux significance threshold required 
for inclusion in the catalog. The default 0.5 pixel event 
position randomization in chip coordinates is used when 
the calibrations are reapplied. 

The main instrument specific calibrations for HRC 
data relate to the "degapping" correction that is applied 
to the raw X-ray event positions to compensate for dis- 
tortions introduced by the HRC detector readout hard- 
ware. Several additional calibrations compensate for ef- 
fects introduced by amplifier range switching and ringing 
in the HRC electronics, and a number of validity tests are 
performed to flag X-ray event positions that cannot be 
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Tool Name 



TABLE 5 
CSC-Related CIAO Tools 



CIAO Version 



Description 



aprates 
dmellipse 


1.1 
4.1 


eff2evt 


1.1 


lim_sens 


4.1 


mkpsfmap 

acis_streak_map 

dither_region 


4.1 
4.1.2 
4.1.2 


evalpos 
glvary 
pileup_map 
modelf lux 


4.1.2 
4.1.2 
4.1.2 
4.1.2 


sreextent 


4.1.2 


create_bkg_map 
dmimgpm 


4.2 
4.2 



Calculate source aperture photometry properties 

Calculate ellipse including specified encircled fraction 

Calculate energy flux from event energies 

Create a limiting sensitivity map 

Look up PSF size for each pixel in an image 

Create a high spatial frequency background map 

Calculate region on detector covered by a sky region 

Get image values at specified world coordinates 

Search for variability using Grcgory-Loredo algorithm 

Create image that gives indication of pilcup 

Calculate spectral model energy flux 

Compute source extent 

Create a background map from event data 

Create a low spatial frequency background map 



properly corrected due to amplifier saturation and other 
effects. 

Since data are recorded continuously during an ob- 
servation, a "Mission Time Line" is constructed during 
standard data processing that records the values of key 
spacecraft and instrument parameters as a function of 
time. These parameters are compared with a set of cri- 
teria that define acceptable values, and "Good Time In- 
tervals" (GTIs) that include scientifically valid data are 
computed for the observation. The GTI filter from stan- 
dard data processing is reapplied without change as part 
of the recalibration process. 

Background event screening performed as part of cata- 
log data recalibration is somewhat more aggressive than 
that performed as part of standard data processing, 
typically reducing the non- X-ray background. For a 
10 ks observation, the median catalog background rate 
is roughly 80% of the nomi nal field background rates 
^Chandra X-ray Centerll2009h . although there is consid- 
erable scatter. F. A. Primini et al. (2010, in prepara- 
tion) include a detailed statistical analysis of the im- 
provements to the non- X-ray background afforded by this 
screening. 

The reduction of the background event rate is achieved 
by removing time intervals containing strong background 
flares. These time intervals are identified separately for 
each chip. First, the background regions of the image 
are identified by constructing a histogram of the event 
data, determining the mean and standard deviations of 
the histogram values, and rejecting all pixels that have 
values more than 3 standard deviations above the mean. 
An optimally-binned light curve of the background pixels 
is then created using the Gregory-Loredo algorithm (see 
§ 3.12.1). Time bins for which the count rate exceeds 
10 x the minimum light curve value are identified. The 
corresponding intervals are considered to be background 
flares, and the GTIs are revised to exclude those periods. 

We emphasize that the objective of this procedure is to 
remove only the most intense background flares, which 
occur relatively infrequently. Time intervals that include 
moderately enhanced background rates are not rejected 
by this process, since their contributions increase the 
overall SNR. The aggregate loss of good exposure time 
exceeds 25% for less than 1.5% of the observations in- 
cluded in the catalog; the loss is greater than 10% for 



3% of the observations, and greater than 5% for 5% of 
the observations. 

For each observation included in the CSC, the recali- 
brated photon event list is archived, together with several 
additional full-field data products. These include multi- 
resolution exposure maps computed at the monochro- 
matic effective energies of each energy band and the as- 
sociated ancillary data products (aspect histogram, bad 
pixel map, and field of view region definition), used to 
construct them (see Table 3). 

3.3. Background Map Creation 

For the first release of the CSC, background maps are 
used for automated source detection. They are created 
directly from each individual observation with the nec- 
essary accuracy. The general observation background is 
assumed to vary smoothly with position, and is mod- 
eled using a single low spatial frequency component. Al- 
though this assumption is in general satisfied across the 
fields of view included in this catalog release, there may 
be localized regions where the background intensity has 
a strong spatial dependence, and therefore where the de- 
tectability of sources may be reduced. Several different 
approaches were considered for constructing the low spa- 
tial frequency background component, including spatial 
transforms, low pass filters, and data smoothing. How- 
ever, the most effective and physically meaningful tech- 
nique is a modified form of a Poisson mean. This method, 
described below, estimates the local background from the 
peak of the Poisson count distribution included in a de- 
fined sampling area. The dimensions of the sampling 
area act effectively as a spatial low pass filter that deter- 
mines the minimum angular size that contributes to the 
background. 

High spatial frequency linear features, commonly re- 
ferred to as "readout streaks," result when bright X-ray 
sources are observed with ACIS. These streaks arise from 
source photons that are detected during the CCD read- 
out frame transfer interval (~ 40 /is per row) following 
each exposure (~ 3.2 s per exposure for a typical obser- 
vation). All pixels along a given readout column are 
effectively exposed to all points on the sky that lie along 
that column during the frame transfer interval, so that 
columns including bright X-ray sources have enhanced 
count rates along their length. Unless accounted for by 
the source detection step, the increased counts in the 
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Fig. 11. — ACIS broad-band high and low spatial frequency background maps for observation 00735, as used for catalog source detection 
(Fig. 12, Right). Left: ACIS high spatial frequency background map component. Each image pixel represents 2x2 blocked sky pixels. 
Intensities have an an offset of +0.1 count (image pixel) - added, and the result is scaled logarithmically over the range 0.0375-3.75. The 
readout streak associated with the bright source is clearly visible. Right: ACIS low spatial frequency background map component. The 
Poisson mean includes a residual image of the bright source, at a peak level of ~ 0.15 count (image pixel) - . 



bright readout streak are detected as multiple sources. 
Although readout streaks are comprised of mis-located 
source photons, we choose to model them as a back- 
ground component. 

Background maps computed for ACIS observations in- 
clude contributions from both components, while HRC 
background maps include only the low spatial frequency 
component. 

The reader should note that background maps are not 
used when deriving source properties such as aperture 
photometry. Instead, a local background value deter- 
mined in an annular aperture surrounding the source is 
used, as described in § 3.4.1. Significant spatial vari- 
ations of the observed X-ray flux on the scale of the 
background aperture will increase the background local 
variance, thus reducing the significance of the source de- 
tection, perhaps below the threshold required for inclu- 
sion of the source in the catalog. This effect is seen in 
some galaxy cores, where the unresolved emission con- 
tributes X-ray flux to the annular background apertures 
surrounding each source. 

3.3.1. ACIS High Spatial Frequency Background 

The al gorithm described he r e is a r efinement of method 
used bv iMcCollough fc Rotsl (|2005|) to address the im- 
pact of readout streaks on source detection. The streak 
map is computed at single pixel resolution independently 
for each ACIS CCD and energy band. The first step 
is to identify the bright-source-free regions on the de- 
tector. For ease of computation the orientation of the 
X-axis is defined to be along the chip rows (perpendic- 
ular to the readout direction) and the Y-axis is defined 
to be along the direction of the readout columns. To 
identify the source-free regions, the photon event totals, 
X sum summed along the X-axis are constructed, and the 
median (X), mode (X), and standard deviation (ax) 



of the distribution of the X sum values are computed. 
These values provide a basic characterization of the back- 
ground. From an examination of many data histograms, 
the maximum value of X sum which can still considered 
background dominated is given by 

X sum (max) = minX + nax, 2 A], 

where n is set to 1. Rows for which X sum 3> X sum (max) 
include a substantial bright source contribution. All rows 
with A sum < X sum (max) (excluding off-chip and dither 
regions) are considered to comprise the source-free re- 
gions and are used to calculate the streak map. 

The average number of events per pixel is calculated 
separately for each readout column (Y-axis direction) 
from all of the rows in the source-free regions. These 
values are replicated across each CCD row to create an 
image that includes the sum of the readout streak con- 
tribution and the mean one-dimensional low spatial fre- 
quency background component. The latter must be ac- 
counted for when combining the high spatial frequency 
readout streak map with the two-dimensional low spatial 
frequency background map. 

For the algorithm to obtain a good measure of the 
background, of order 100 bright-source-free rows are re- 
quired. This condition is satisfied for most observations. 
Observations with too few source-free rows poorly sam- 
ple the background. This can lead to erroneously low in- 
tensities for bright readout streaks in the resulting back- 
ground map, which may enhance the false source rate 
along these streaks. Faint sources that fall in the source- 
free rows will be considered to be part of the background, 
which can lead to similar results. Nevertheless, the al- 
gorithm is remarkably effective, even in crowded regions 
such as the Orion complex and the Galactic center fields. 
An example broad-band ACIS streak map, created for 
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Fig. 12. — ACIS broad-band image of the central region of the field of observation 00735 (M81), which includes an extremely bright 
source that produces a very bright readout streak. Because photon pile-up has eroded the central peak of the bright source, the source is 
detected incorrectly as multiple close sources that must be rejected manually. Left: Numerous false sources are detected along the length 
of the readout streak if the latter is not modeled as part of the background. Source detections are shown in cyan. No quality assurance 
processing has been applied to these detections. Right: When the background map described in the text is used, the false sources are 
suppressed. Source detections in green are included in the catalog; sources in red do not meet the minimum flux significance criteria for 
inclusion in the catalog; sources in magenta have been rejected manually during quality assurance processing. 



observation 00735 (M81), is shown in Figure 11, Left. 



3.3.2. Low Spatial Frequency Background 

For each observation, a low spatial frequency back- 
ground map is constructed separately for each en- 
ergy band and image b locking factor (see § 3.4). 
iMcCollough fc Rotsl (|2008f ) provide an initial discussion 
of this algorithm and general background map creation. 

As described above, for ACIS observations the high 
spatial frequency background map includes a component 
that represents the one-dimensional average of the low 
frequency background over the rows used to create the 
streak map. This component, as well as the high spatial 
frequency background, are removed by subtracting the 
streak map from the original image from which it was 
created. For each image blocking factor, the difference 
image is constructed by subtracting the appropriately re- 
gridded streak map from the corresponding blocked orig- 
inal image. 

For each pixel in the resulting difference image, a cen- 
tered sampling region with dimensions n x n pixels is 
defined. Spatial scales smaller than ~ n pixels are atten- 
uated. The sampling regions are truncated at the edges 
of the images, and so some higher frequency information 
may propagate into the background map. However this 
effect has not been found to have any significant impact 
on the utility of the resulting map. 

A histogram of the count distribution is constructed 
from the pixels included in the sampling region asso- 
ciated with each image pixel. The first histogram bin 
will typically span the count range from —0.5 to +0.5 
for ACIS observations, since the readout streak map has 
been subtracted and there will be some negative pixels. 
The low spatial frequency background at this image pixel 
location is computed using a modified form of a Poisson 



mean 



mean [h (a) U h (b) U/i(c)], 



where h(x) is the number of counts in histogram bin x, 
a is the bin with the maximum number of histogram 
counts, and b and c are the lower and higher bins imme- 
diately adjacent to a. The low spatial frequency back- 
ground map is formed by computing bu for each pixel lo- 
cation in the image. For ACIS observations, n — 129 pix- 
els, corresponding to a spatial scale of order V for images 
blocked at single pixel resolution. Figure 11, Right dis- 
plays the ACIS broad-band low spatial frequency map for 
observation 00735 (M81) that corresponds to the streak 
map shown in the left hand panel of the figure. 

3.3.3. Total Background Map 

The first step in creating the total background map is 
to correct the readout streak map (for ACIS observations 
only) for the effects of reduced exposure near the edges 
of the observation that arise due to the spacecraft dither, 
by dividing by the appropriate band-specific normalized 
exposure map. Similarly, the low spatial frequency back- 
ground map is corrected by dividing by the smoothed, 
band-specific normalized exposure map. The smoothing 
that is applied to the normalized exposure map in the lat- 
ter case matches the smoothing applied when construct- 
ing the low spatial frequency background map. Finally, 
the two background components for each energy band 
are summed to produce the total exposure-normalized 
background map that is required for source detection. 

Figure 12 displays the central region of the broad-band 
ACIS image of M81 (observation 00735), with source de- 
tections overlayed. The source detections shown in the 
left-hand panel are those that result if the background is 
modeled internally by wavdetect (see § 3.4, below); the 
panel on the right shows the source detections resulting 
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from using the total background from Fig. 11. Using 
the background map has eliminated the false sources de- 
tected on the readout streak. 

The total background maps for each energy band are 
also archived and accessible through the catalog. These 
maps differ from those used for source detection in 
that they have been multiplied by the normalized band- 
specific exposure map, and arc therefore recorded in units 
of counts. For the convenience of the user, we also store 
multi-resolution photon-flux images for the full field of 
each observation, created by filtering the photon event 
list by energy band, binning to the appropriate image 
resolution, subtracting the total background map appro- 
priate to the energy band, and dividing by the corre- 
sponding exposure map. 

3.4. Source Detection 

Candidate sources for inclusion in the CSC are identi- 
fied using the CIA O wavdetect wavelet -based source de- 
tection algorithm (Fre eman et al.|[2002f ). wavdetect has 
been used succ essfully with Chandr a data by a number 
of authors (e.g ., Brandt ct al. 2 001t iGiaconni e t al. 2002; 
iLehmer et aTJ T2005: Ki m et al.ll2007HMuno et all 12009')! 
and its capabilities and limitations are well known (e.g., 
iValtchanov et~al|[200l . 

Early in the catalog processing pipeline develop- 
ment cycle, several different methods for detecting 
sources were evaluated. In addition to wavdetect, 
these include d the CIAO imple m entations of the 
slidin g cell (jHarnden et al.1 Il984t iCalderwood et all 
1200 ID and Voronoi t e ssella tion and percolation 
(jEbeling fc Wiedenmannl [1993) alg orithms, and a 
versio n of the SExtractor package ()Bertin fc Arnouti 
I1996T ) modified locally to use Poisson errors in the low 
count regime. 

The Voronoi tessellation and percolation algorithm was 
quickly discarded because of the significant computa- 
tional requirements and complexities for automated use. 
A series of simulations was used to compare the perfor- 
mance of the remaining methods with respect to source 
detection efficiency for isolated point sources, the effi- 
ciency with which close, equally-bright pairs of point 
sources with 2" and 4" separations are resolved, and false 
sourc e detection rate ( A. Dobrzycki, private communica- 
tion; lHain et all 120041 ). The first two properties were 
evaluated for point sources containing 10, 30, 100, and 
2000 counts, with off-axis angles 0-10' with 1' spacing, 
and nominal background rates for exposure times of 3, 
10, 30, and 100 ks. The false source rate was evaluated as 
a function of off-axis angle for the same exposure times. 

All three detection algorithms performed reliably for 
bright, isolated sources located close to the optical axis. 
Compared to the remaining methods, wavdetect had 
better source detection efficiency for faint sources located 
several arcminutes off-axis, and was able to resolve close 
pairs of sources more reliably than the sliding cell tech- 
nique. The locally modified version of SExtractor pro- 
vided inconsistent results, in some cases detecting large 
numbers of spurious sources. 

These simulations were performed early in the cata- 
log processing pipeline development cycle, as an aid in 
selecting the source detection algorithm to be used for 
catalog construction. They did not make use of the back- 
ground maps described in the previous section. The ac- 



tual performance of the source detection process used to 
construct the CSC is established from more detailed and 
robust simulations, as described in § 4 and references 
therein. 

Based on the results of the simulations, wavdetect was 
selected as the source detection method of choice for the 
CSC. 

The wavdetect algorithm does not require a uniform 
PSF over the field of view, and is effective in detect- 
ing compact sources in moderately crowded fields with 
variable exposure and Poisson background statistics. To 
detect candidate sources in a two-dimensional image D, 
wavdetect repeatedly constructs the two-dimensional 
correlation integral 

C{x, y;a)= / / dx' dy' W(x -x',y- y'; a) D(x' , y') 

(2) 
for a set of Marr ("Mexican Hat") wavelet functions, W, 
with scale sizes that are appropriate to the source di- 
mensions to be detected. The elliptical form of the Marr 
wavelet may be written in the dimensionless form 



Ty( a; ,y;a) = (2-p 2 )cxp(V/2), 



(3) 



where 



p 2 = -^(x cos cf> + y sin cf>) 2 -\ — =■(— x sin cf> + y cos <fi) 2 
a{ a% 

and the parameters a — (oi, 02, <fi) define the semi-major 
and semi-minor radii and rotation angle of the Mexican 
Hat. 

A localized clump of counts in the image D will pro- 
duce a local maximum of C if the scale sizes defined by 
a are approximately the same as, or larger than, the 
dimension of the clump. To determine whether a local 
maximum of C is due to the presence of a source, the 
detection significance, Si t j, in each image pixel (i,j) is 
determined from 



°i,j ~ 



dC p(C\rtB,i,j) 



J/- 



where nB,i,j is the number of background counts within 
the limited spatial extent of W, and p(C\riB,i,j) is the 
probability of C given the background B. If Sij < So, 
where So is a defined limiting significance level, then pixel 
(i,j) is identified as a source pixel. 

The limiting significance level used to generate the 
CSC is set to S = 2.5 x 10~ 7 . This formally corre- 
sponds to ~ 1 false source due to random fluctuations 
per 2048 x 2048 pixel image, although due to the heuris- 
tics of the algorithm, the actual number of false sources 
may be lower. The situation is further complicated in 
our case because the final candidate source list output 
from the CSC source detection pipeline is a combination 
of several wavdetect runs in different energy bands (see 
below). We note that reliable quantitative estimates of 
the false source rates and detection efficiency can only 
be provided through simulations, as discussed in § 4. As 
described in § 2.5, we impose an additional restriction 
on the flux significance of a source. To ensure that the 
flux significance requirement is the defining criterion for a 
source to be included in the catalog, we have verified that 
the flux significances of sources that pass our wavdetect 
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Fig. 13. — Top: Estimated flux significance versus catalog flux significance for ~ 11, 000 sources detected in the ACIS broad energy band 
in a pre-release test version of the CSC. The "estimated" flux significance is defined as the ratio net_counts/net_counts_err, as reported by 
wavdetect, and correlates well with the actual flux significance used the determine catalog inclusion. Horizontal lines indicate the median of 
the points in each bin, and the vertical lines identify the extreme points. Boxes include 90% of the points in each bin. Bottom: Distribution 
of estimated flux significances for all detected sources (solid line), including those which fell below the flux significance threshold for the 
test catalog. The distribution of estimated significances for sources included in the catalog is shown by the dotted line; the dashed line 
is the distribution of actual flux significances for the same sources. The flux significances for all detected sources extends well below the 
distributions for sources included in the catalog. 



threshold extend well below that required to satisfy the 
flux significance rule (see Figure f3). We estimate that 
roughly ~ 1/3 of all the sources detected by wavdetect 
fall below this threshold. 

Source detection is performed recursively by applying 
wavdetect to multiply-blocked sky images constructed 
as described in § 2.5.3. The use of a constant blocked 
image size maintains algorithm efficiency while not com- 
promising detection efficiency in the outer areas of the 
field of view where the PSF size is significantly larger 
than a single pixel. 

Applied to the CSC, wavelets with scales a, = 1, 2, 4, 
8, and 16 (blocked) pixels are computed for each image 
blocking factor and each energy band except for the ACIS 
ultra-soft band. This combination of wavelet scales and 
image blocking factors provides good sensitivity for de- 
tection of sources with observed angular extents < 30". 
Some point sources with extreme off-axis angles, 6 > 20', 
may not be detected because the size of the local PSF ex- 
ceeds the largest wavelet scale/blocking factor combina- 
tion. F. A. Primini et al. (2010, in preparation) calibrate 
this effect statistically. 

Source detection is not performed in the ACIS ultra- 
soft energy band. This band is impacted heavily both 
by increased background and by decreased effective area 
because of ACIS focal plane contamination (the ratio of 
integrated background to effective area is 1-2 orders of 
magnitude larger for the u band when compared to the 
other ACIS energy bands). Under these circumstances 
we are limited by the accuracy of the background map 
determination; small errors in the background map result 



in an unacceptable fraction of spurious source detections. 

The wavdetect algorithm incorporates steps to com- 
pare nearby correlation maxima identified at multiple 
wavelet scales to ensure that each source is counted only 
once. After duplicates are eliminated, a source cell that 
includes the pixels containing the majority of the source 
flux is constructed. Although a source cell may have an 
arbitrary shape, for simplicity an elliptical representation 
of the source region is used throughout the CSC. The 
lengths of the semi-axes of this source region ellipse are 
set equal to the 3 a orthogonal deviations of the distri- 
bution of the counts in the source cell. 

Source region ellipses for candidate sources detected 
within a single observation from images with different 
blocking factors or in different energy bands are com- 
bined outside of wavdetect to produce a single merged 
source list. This step rejects any detections that have 
RMS radii smaller than the 50% enclosed counts fraction 
radius of the local PSF, calculated at the monochromatic 
effective energy of the band in which the source is de- 
tected. Such detections are likely artifacts arising from 
cosmic ray impacts. Candidate source detections whose 
centroids are closer than the local PSF radius, or that 
are closer than 3/4 of the mean detected source ellipse 
radii, are deemed to be duplicates. If any duplicates 
are identified, then the detection from the image with 
the smallest blocking factor is kept, and if the image 
blocking factors are equal, then the detection with the 
highest significance is used. This approach ensures that 
data from the highest spatial resolution blocked image 
will be used to detect point and compact sources. How- 
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Fig. 14. — Histogram of detected source PSF fractions. For 
sources with off-axis angles 8 < 10', the PSF fraction included 
in the source region aperture is shown by the solid line, while the 
dashed line displays the PSF fraction included in the background 
region aperture. For sources with 8 > 10', the dotted line repre- 
sents the PSF fraction included in the source region aperture and 
the dash-dotted line indicates the PSF fraction included in the 
background region aperture. 

ever, knotty emission that is located on top of extended 
structures will tend to be identified as distinct compact 
sources, while the extended emission is not recorded. 

3.4.1. Source Apertures 

Numerous source-specific catalog properties are evalu- 
ated within defined apertures. We define the "PSF 90% 
ECF (enclosed counts fraction) aperture" for each source 
to be the ellipse that encloses 90% of the total counts in 
a model PSF centered on the source position. Because 
the size of the PSF is energy dependent, the dimensions 
of the PSF 90% ECF aperture vary with energy band. 

We define the "source region aperture" for each source 
to be equal to the corresponding 3 a source region ellipse 
included in the merged source list, scaled by a factor of 
1.5x. Like the PSF 90% ECF aperture, the source re- 
gion aperture is also centered on the source position, but 
the dimensions of the aperture are independent of energy 
band. Evaluation of model PSFs with off-axis angles 
< 10' demonstrates that the dimensions of the source 
region aperture correspond approximately to the dimen- 
sions of the PSF 90% ECF ellipses for the ACIS broad 
energy band. This is confirmed a posteriori by examining 
the distribution of PSF aperture fractions in source and 
background (see below) region apertures of all individual 
catalog sources with ACIS broad band flux significance 
> 3.0. Figure 14 demonstrates that the source region 
apertures typically include ~ 90-95% of the PSF, while 
the background region apertures contain < 5-10%. We 
emphasize that while these fractions are typical, the ac- 
tual PSF fractions, determined by integrating the model 
PSF over the source and background region apertures 
and excluding regions from contaminating sources, are 
used for the actual determination of source fluxes (see 
§ 3.7). 

Comparison of the source fluxes within the PSF 90% 
ECF aperture and the source region aperture provide a 
crude indication whether a source is extended. If the flux 
in the source region aperture is significantly greater than 
the flux in the PSF 90% ECF aperture, then the source 



region determined by wavdetect is considerably larger 
than the local PSF, and the source is likely extended. 

Both the PSF 90% ECF aperture and the source region 
aperture are surrounded by corresponding background 
region annular apertures. In both cases, the inner edge 
of the annulus is set equal to the outer edge of the corre- 
sponding source aperture, while the radius of the outer 
edge of the annulus is set equal to 5 x the inner radius 
of the source region aperture. Although the background 
region apertures defined in this manner include ~5-10% 
of the X-rays from the source, this contamination is ac- 
counted for explicitly when computing aperture photom- 
etry fluxes. 

Overlapping sources could contaminate any measure- 
ments obtained through the source and background aper- 
tures. To avoid this, both types of apertures are modi- 
fied to exclude areas that are included in any overlap- 
ping source region apertures, or that fall off the de- 
tector. Areas surrounding ACIS readout streaks are 
also excluded from the modified background apertures. 
Aperture- specific catalog quantities are derived from the 
event data in the appropriate modified aperture. The 
fractions of the local model PSF counts that are included 
in the modified apertures are recorded in the catalog for 
each source, and are used to apply aperture corrections 
when computing fluxes, under the assumption that the 
source is well modeled by the PSF. 

The modified source region and background region 
aperture definitions are record ed as FITS files using th e 
spatial region file convention ((Rots & McDowell 2008). 
CIAO (|Fruscione et al.ll2006[ ) can be used to apply these 
regions as spatial filters to extract the photon event data 
for the source (or background) from the archived photon 
event list. To simplify access to file-based data products 
(see Table 3) for individual sources, we also separately 
store the source region photon event list, per-band ex- 
posure maps, and per-band source region images. These 
products include data from the rectangular region of the 
sky that is oriented North-South/East-West and that 
bounds the background region. 

3.4.2. Matching Source Detections from Multiple 
Observations 

Each source record in the CSC Master Sources Ta- 
ble is constructed by combining source detections in- 
cluded in the Source Observations Table from one or 
more observations. A necessary first step in this pro- 
cess requires matching the source detections from all of 
the observations that include the same region of the sky . 
Cross-matching algorithms fe.g.. iDevereux et al.l 120051 : 
iGrav et"al]|2006f ) are often focussed on efficiently match- 
ing large catalogs, and typically use criteria on the posi- 
tion difference distribution, or cross-correlation approach 
techniques, for identifying matches. In many cases, these 
approaches assume (often implicitly) that the source PSF 
is at least approximately spatially uniform across the 
field of view, and comparable between the datascts being 
matched. 

However, when matching source detections across mul- 
tiple Chandra observations, the strong dependence of 
the PSF size with off-axis angle must be considered ex- 
plicitly, since source detections that are well off-axis in 
one observation are often resolved into multiple sources 
close to the optical axis in other observations. Under 
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Fig. 15. — Left: Upper and lower images illustrate the common source matching case where the source detections from the individual 
observations all uniquely match a single source on the sky. The source region aperture determined from the upper image is shown in cyan, 
while the source region aperture determined from the lower image is shown in red. Center: In this case, the off-axis source region aperture 
computed from the source detection in the upper image, shown in cyan, overlaps multiple source region apertures from the observation in 
the lower image, shown in red. The cyan source detection is confused, and will be connected to the master sources associated with the 
red source detections using "ambiguous" linkages. Right: The sources detected in these observations for a confused "pair of pairs." The 
fractional overlaps between the pair of cyan source region apertures and the pair of red source region apertures is sufficiently large that 
these detections are assigned to be resolved by human review. 



these circumstances the source positions determined by 
wavdetect are not comparable, and cannot be used for 
source matching. Instead, the source matching approach 
used for the CSC is based on the overlaps between the 
PSF 90% ECF apertures of the source detections from 
the individual observations. Although empirical in na- 
ture, this algorithm works well for matching compact 
source detections between Chandra observations. 

The detailed algorithm is described in Appendix A. 
The method identifies the overlap fractions between the 
PSF 90% ECF apertures of overlapping source detections 
from the observations, and separates them into three dif- 
ferent categories. 

The first category is the simplest, where the source 
detections from the various observations have apertures 
that all mutually overlap (Fig. 15, Left). This is the most 
common situation, and corresponds to the case where 
the source detections all uniquely match a single source 
on the sky. Roughly 90% of the ~ 18, 000 sources in 
the Master Sources Table that are linked to more than 
one source detection in the Source Observations Table 
fall into this category. Each of the matching entries in 
the latter table will be associated with the corresponding 
Master Sources Table entry with a "unique" linkage, as 
described in § 2.3. 

In the second category, the aperture associated with 
a source detection in one observation overlaps the aper- 
tures associated with multiple distinct source detections 
from other observations. This circumstance typically 



arises because source detections from a single observation 
are always assumed to be distinct; this assumption can 
fail very far off-axis (9 > 20'), where the PSF size exceeds 
the maximum wavdetect wavelet scale/image blocking 
factor combination. This category is illustrated in Fig- 
ure 15, Center, and arises most often because a source 
detection in one observation is resolved into multiple 
sources by one or more of the overlapping observations. 
The unresolved source detection in the Source Observa- 
tions Table will be connected to all Master Sources Table 
entries associated with the matching resolved source de- 
tections via "ambiguous" linkages, and the detection will 
be flagged as confused. The X-ray photon events associ- 
ated with the unresolved detection cannot be distributed 
across the matching resolved sources. In release 1 of the 
CSC, source properties derived from the detection will 
not be used to compute the source properties included 
in the Master Sources Table. Upper limits for photomet- 
ric quantities could in principle be extracted from the 
unresolved source detection, and these would be quite 
valuable for variability studies. This capability will be 
included in a future release of the CSC. 

In a few cases, a set of aperture overlaps cannot be re- 
solved automatically using the current algorithm. This 
third category typically occurs when there are multiple 
overlapping, confused source detections. In this case, the 
source detections are flagged for review by a human, who 
is then responsible for resolving the matches. Only 415 
(out of 94,676) master sources include source detections 
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that required manual review, and the majority of these 
were readily resolved as confused "pairs of pairs." The 
latter case, which is illustrated in Figure 15, Right com- 
monly occurs because a pair of source detections included 
in a single observation both overlap a pair of source de- 
tections in another observation. Each of the source de- 
tections in the first observation overlaps both source de- 
tections in the second observation, making the detections 
confused, and vice- versa. If a manual review is required 
to complete a match for a specific source, then a flag is 
set in the catalog to indicate this fact. 

The actual processing required to perform the matches 
is complex. Whenever new overlapping source detec- 
tions are identified during catalog processing, the set of 
matches is recomputed using all of the observations pro- 
cessed so far. The algorithm then queries the prior state 
of the catalog, and determines the set of updates that 
are necessary to migrate from that state to the newly 
determined state. This procedure works regardless of 
the order in which observations processed, and also per- 
mits an already-processed observation to be reprocessed 
should an error have occurred. 

3.4.3. Source Naming 

Each distinct X-ray source included in the Master 
Sources Table is assigned a name that is derived from 
the source's location on the sky. Catalog sources are 
designated "CXO JHHMMSS.s ± DDMMSS '," where 
HHMMSS.s and ±DDMMSS are the ICRS right ascen- 
sion and declination, respectively, of the source position, 
truncated to the indicated precision. This format com- 
plies with the International Astronomical Union (IAU) 
Recommendations for Nomenclature!]. The "CXO" pre- 
fix is registered with Commission 5 of the IAU for ex- 
clusive use in source designations issued by the Chandra 
X-ray Center. 

The name assigned to a source is determined from 
the combined source position once the detections of the 
source are merged according to § 3.4.2. If the source has 
never been included in a released version of the CSC, 
then the source name may be revised if a source detec- 
tion in a subsequently processed observation modifies the 
combined source position. Therefore, the name assigned 
to a source that is visible in a database view may change 
as new observations are processed, if the source has never 
been included in a catalog release. 

Once a source is included in a released version of 
the CSC, then the name of that source is frozen. The 
name will not be changed in cither future catalog re- 
lease views (i.e., subsequent catalog releases) or database 
views, even if additional observations refine the source 
position. Therefore minor discrepancies can arise be- 
tween the latter and the source designation. However, 
if an observation included in a later catalog release re- 
solves an apparently single source included in an earlier 
version of the catalog into multiple distinct sources, then 
the previous source designation is retired and new names 
are assigned to the resolved sources. 

3.5. Source Position Determination 

Within a single observation, the detected source po- 
sitions are those assigned by the wavdetect algorithm. 
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Fig. 16. — Mean (signed) coordinate differences between SDSS 
and CSC source positions measured along the £ vector (defined 
in the text) as a function of off-axis angle, 9, for individual ob- 
servations of CSC sources with at least 500 net counts are shown 
as filled circles. Open symbols indicate expected values computed 
from high SNR ray-trace simulations at the indicated values of 9 
and <p. Note that the measured mean coordinate differences are 
consistent with zero offset for 9 < 8', and are a factor ~ 2 times 
smaller than the model predictions for larger values of 9. 



Their accuracy can be estimated by evaluating the mean 
(signed) coordinate differences between the wavdetect 
positions and the positions of matching sources in the 
seventh data r elease (DR7) of the S loan Digital Sky 
Survey (SDSS; lAbazaiian et all 12009ft . The latter are 
extracted from the CSC/SDSS Cross-Match Catalog. 
As shown in Figure 16, the mean coordinate differences 
demonstrate good position agreement between the CSC 
and SDSS for sources with 8 < 8' (where we have re- 
stricted the comparison to include only individual obser- 
vations of CSC sources with at least 500 net counts to 
minimize statistical errors). 

For larger off-axis angles, Figure 16 suggests that there 
may be a systematic offset between the wavdetect source 
positions and the SDSS source positions. The measured 
mean position difference is < 0'.'3 for 6 < 15', but appears 
to increase with off-axis angle. The ex act cause of this 
offset is uncertain. Some authors (e.g.. lAlexander et all 
l2003tlLehmer et alJl2005HLuo et al.ll2008ft have reported 
a similar effect, which they attribute to centroiding errors 
introduced by of the asymmetric nature of the Chandra 
PSF at large off-axis angles. 

High quality PSF simulations generated using 
the SAOTrace ( f ormer ly SAOsac) ray-trace code 
(jJerius et all [1995, 2004) confirm that the asymmetry 
can displace the measured centroid from the requested 
location of the simulated PSF on a uniform pixel grid. 
We designate the vector orientation from the measured 
centroid position to the requested location of the PSF 
as £. As shown in Figure 16, the expected centroid dis- 
placement along £ computed from the simulations is not 
a good measure of the actual mean coordinate difference, 
which is a factor of order 2 times smaller than predicted 
by the models. One possible reason for the disagreement 
between the model and actual measurements is that the 
Chandra plate scale calibration was derived from obser- 



7 http : //edsweb .u-strasbg . f r/cgi-bin/Dic/iau- spec . htx 



3 http : //exc . cf a . harvard . edu/cgi-gen/cda/CSC-SDSSxmatch . html 
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vations of NGC 2516 and LMC X-l using source centroid 
measureme nts that were not co rrected for the asymmetry 
of the PSF (|Markevitch|[200lbl) . 

As described in § 2.5.3, the celestial coordinates of a 
source observed by Chandra are computed by applying 
a series of transforms to the measured position of the 
source on the detector. The final step in this process 
applies the measured plate scale calibration to the differ- 
ence between the position of the source on the virtual sky 
pixel plane and a known fiducial point, which is typically 
the telescope optical axis. If all of the star-star baselines 
used to calibrate the plate scale were oriented parallel 
to £, then the lack of correction for the PSF asymmetry 
would to first order compensate for the linear compo- 
nent of the systematic position offset when measuring 
real sources whose locations are fixed in world coordi- 
nates rather than virtual pixel plane coordinates. Since 
not all star-star baselines were so aligned, some residual 
systematic position offset may be expected, but at an 
undetermined level that is less than predicted from the 
PSF simulations. 

Systematic position offsets can also arise from uncer- 
tainties in the detector geometry. As an example, con- 
sider imaging observations that use the nominal ACIS-I 
aimpoint. Sources with 6> < 8' will be located on the 
same CCD array as the aimpoint. As 9 increases, an in- 
creasing fractions of sources will instead be positioned on 
ACIS-S array CCDs, until for 9 > 11' all sources will be 
located on the ACIS-S array. Uncertainties in the rela- 
tive positions and tilts of the ACIS-I and ACIS-S arrays 
would therefore introduce systematic position offsets for 
sources with large off-axis angles, while not impacting 
sources that fall on the same CCD array as the aim- 
point. This signature is consistent with the absence of 
mean position differences measured for 9 < 8'. 

Because of the small magnitude of the mean position 
differences measured for 9 < 15', we have chosen not 
to apply an uncertain correction to source positions in 
release 1 of the catalog. We plan to investigate in detail 
the cause of the systematic position offsets at large off- 
axis angles, and adjust source positions in future catalog 
releases if appropriate. 

3.5.1. Source Position Uncertainty 

In addition to reporting measured source positions, 
wavdetect also reports positional errors associated with 
each source detection. The reported errors are based 
on a statistical moments analysis, and do not consider 
instrumental effects such as pixelization, aspect-induced 
blur, or asymmetrical PSF structure that may contribute 
to the total positional uncertainties. Simulations that 
compare the reference positions of artificially generated 
sources with their positions determined by wavdetect 
indicate that the positional uncertainties computed by 
wavdetect arc underestimated for sources with large 
off-axis angles. The simulation results, which quantify 
the dependence of positional uncertainties of simulated 
sources on off-axis angle, were found to be consistent 
with the more extensive simulations uses to construct 
the Chandra Multiw avelength Projec t (ChaMP) X-ray 
point source catalog ([Kim et al.ll2007l) . 

In the first release of the CSC, source position error 
ellipses are substituted by error circles computed using 



the ChaMP positional uncertainty relations 



logP = 



0.114561 - 0.4957 log S w + 0.1932 

0.0000 < log S w < 2.1393 

0.09686* - 0.2064 log S w - 0.4260 

2.1393 < log S w < 3.3000. 



(4) 

In these equations, P is the positional uncertainty in arc- 
seconds, 9 is the off-axis angle in arcminutes, and S w is 
the source net counts reported by wavdetect. These re- 
lations were derived to characterize the positional uncer- 
tainties at the 95% confidence level of X-ray point sources 
in the ChaMP X-ray point source catalog, which includes 
~6, 800 X-ray sources detected in 149 Chandra observa- 
tions. The values of log P computed using equations (01 
are not equal at the boundary where log 5^ = 2.1393 
(roughly 138 net counts). However, this error is negligi- 
ble for 9 <10'. 

Although HRC observations are not included in the 
first release of the CSC, we have used a series of simu- 
lations to derive an improved positional uncertainty re- 
lation that is appropriate for sources detected in HRC- 
I observations. The simulations include ~ 6, 000 point 
sources spanning < 9 < 22' and 9 < S w < 3600. The 
best fit surface for the 95% position uncertainty quantile 
is 

log P = 0.752569 + 0.2169856" + 0.0002426> 2 



- 1.142476 log S w 
-0.0405496* log S u 



0.172132 log 2 S w (5) 



The simulations do not sample the region with 9 > 20' 
and S w <^ 50, and so we impose an upper bound of 
logP = 2.128393 on this relation, to cap the uncertainty 
in this regime. 

In release 1.1 of the CSC, positional uncertainties for 
sources detected in ACIS observations are computed us- 
ing equation (@| while positional uncertainties for sources 
detected in HRC-I observations are computed using 
equation ([5]). 

The positional uncertainties from equations (j4|) and ([5]) 
provide a good measure of the statistical uncertainty of 
the location of the source in the frame of the observa- 
tion, but do not consider potential sources of error that 
are external to the observation. These include the error 
in the mean aspect solution for the observation, the as- 
trometric errors in the AXAF (Chandra) Guide and Ac- 
quisition Star Catalog (jSchmidt fc Greenl 2003) . and the 
calibration of the geometry o f the s pacecr aft and focal 
plane. As described in § 4.3, iRotsI (|2009f) has recently 
used the CSC/SDSS Cross-Match Catalog to calibrate 
the combined external error by analyzing the statistical 
distribution of the measured separations of CSC point 
source detections from indiv idual observations with their 
counterparts in SDSS DR7 (jAbazaiian et alj|2009[) . 

The resulting external astrometric error is 0'.'16±0'.'0.01 
(la-), or 0'.'39 (95% confidence). The latter must be 
added in quadrature to the position uncertainties from 
equations (j4j and <j5j) to compute the absolute position 
error for CSC sources. In release 1 of the catalog, the 
positional error reported in the catalog tables is taken 
directly from equation (@| , so the quadrature addition of 
the external astrometric error component must be per- 
formed by the user. Release 1.1 of the CSC will include 
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Fig. 17. — Histograms of the total absolute position error for 
source detections included in release 1 of the CSC. The dashed 
and dotted lines include all source detections with off-axis angles 
8 < 8' and 8' < 8 < 15', respectively. Each histogram is normal- 
ized by the total number of source detections within the relevant 
8 range. Source detections with at least 500 net counts are shown 
with the solid line (8 < 8') and dash-dotted line (8' < 8 < 15'). 
For the latter two histograms, the normalized source fraction has 
been scaled by a factor of 4 X . 



this error component directly in the tabulated values. 

A post facto histogram of the 95% confidence posi- 
tional uncertainties, including the external astrometric 
error, for source detections included in release 1 of the 
CSC is presented in Figure 17. The figure demonstrates 
that statistical errors due to lack of net source counts 
does dominate the positional uncertainty for all but the 
brightest sources. However, for bright sources detected 
at small off-axis angles, the external astrometric error 
limits the accuracy of the derived source positions. Any 
error introduced by using uncorrected source centroid po- 
sitions from wavdetect is negligible for the overwhelming 
majority of source detections. 

3.5.2. Combining Source Positions from Multiple 
Observations 

Improved estimates of the position and positional un- 
certainty of each X-ray source are determined from the 
statistically independent source detections included in 
the set of individual observations using a multivariate 
optimal weighting formalism. This technique is effec- 
tive in cases where simple averaging fails, for example 
when the area defining the source position varies signif- 
icantly from observation to observation. We express the 
uncertainties of the estimates in the form of error el- 
lipses centered upon the estimated source positions. An 
equivalent approach has been used for weapons targeting 
(|Orechoves kv 1996). To our knowledge the usage here is 
the first documented application to astrophysical data. 

In the multivariate optimal weighting formalism, given 
a set of estimates, Xj, of the mean of some two- 
dimensional quantity, and the 2x2 covariance matri- 
ces, of, associated with these estimates, an improved 
estimate, X, of the mean, and the associated covariance 
matrix, a 2 , are (e.g.. lDavisll2007bf ) 



X = (T 



2V 



x, 



E 



1 



For the application described here, we take Xi to be 
the ith estimate of the source position, projected onto 
a common tangent plane (which is constructed at the 
mean position of the ellipse centers) . The corresponding 
covariance matrix is 



af i cos 2 di + a'^i sin 2 di (a^ — a'^ t ) cos $i sin #j 

( a 2i ~ a li) C0S ^i Sm $» a \i Sm2 &i + a 2i COs2 ^i 

where a[ i and a' 2 j are the lengths of the semi-minor 
and semi-major axes, respectively, of the ith error ellipse 
projected onto the common tangent plane, and $i is the 
angle that the major axis of the zth error ellipse makes 
with respect to the tangent plane y axis. The derivation 
of equation ([7]) is presented in Appendix B. 

Once the covariance matrices corresponding to the er- 
ror ellipses for each individual source observation, equa- 
tion |7)l. are computed, the error ellipses are combined 
using equation ([6|). This yields the optimally weighted 
source position and position error ellipse for the com- 
bined set of observations, on the common tangent plane. 
Mapping these back to the celestial sphere provides the 
combined source position and error ellipse estimates. 

3.6. Source Extent Estimates 

The observed spatial extent of a source is estimated 
using a rotated elliptical Gaussian parameterization of 
the form 



S(x,y;a) 



so 

CT1CT2 



exp [— 7r(.4x) 2 ] , 



(8) 



where 



--<f"-. 



cos ipo sm <po 
- sin 4>q cos 4>q 



(6) 



where (x, y) is the Cartesian center location of the Gaus- 
sian, and the parameters a = (<7i, o~2, 4>o) are t ne 1 a radii 
along the major and minor ellipse axes, and the position 
angle of the major axis of the ellipse, respectively. 

The parameters of S are determined using a wavelet- 
based approach that is similar to that used for source 
detection. The methods differ in their details and as- 
sumptions, however. 

For source detection, the choice of wavelet scales used 
by wavdetect to detect sources is determined a priori. 
Because of the strong variation of PSF size with off-axis 
angle, multiple wavelet scales and input image blocking 
factors are required to search for sources at all off-axis an- 
gles, as described in § 3.4. Stepping between the discrete 
wavelet scales and image blocking factors as a function of 
off-axis angle introduces small but systematic biases in 
the derived dimensions of the source region ellipses (and 
therefore the source region apertures; see Figure 18). 

Photometric, spectral, and temporal properties deter- 
mined from the X-ray events included in the aperture 
are not impacted by these effects, since the aperture di- 
mensions are sufficiently large that they typically enclose 
^90% of the PSF counts, and a correction factor is ap- 
plied for the fraction of the PSF that falls outside of the 
aperture. However, these biases render the source region 
aperture dimensions unsuitable for use as a measure of 
the source extent. 
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Fig. 18. — Comparison of source region aperture dimensions with raw (undeconvolved) source extent for the ACIS broad (b) energy band 
for all source observations included in the first release of the CSC. Left: Semi-major axis of the source region aperture is plotted for each 
source observation versus source off-axis angle, 8. The step-like structure visible in the plot results from the stepping between discrete 
wavelet scales and image blocking factors as a function of off-axis angle. Right: Semi-major axis of the raw (undeconvolved) ACIS broad 
(6) energy band source extent is plotted versus 8. The source extent is derived using a scaleless wavelet approach, as described in the text, 
and varies smoothly with 8. Differing geometric tilts of the ACIS-I and ACIS-S CCD arrays relative to the focal plane split the distribution 
into separate populations for 3' < 8 < 7'. The lower population corresponds to point sources located on ACIS-I array CCDs, while point 
sources located on ACIS-S array CCDs comprise the upper population. The difference in the vertical scales between the two plot panels 
results because the source region aperture scales approximately as the PSF 90% ECF, whereas the source extent estimates the 1 a scale of 
an elliptical Gaussian parameterization of the source. 



The wavelet-based approach used for estimating the 
source extent determines the optimal wavelet scale size 
directly from the data under the assumption that a 
source exists at approximately the location determined 
by wavdetect. The raw source extent estimates derived 
using this approach vary smoothly with off-axis angle 
(Fig. 18). 

The two-dimensional correlation integral 

C(x,y;a) = 

J_ x J_ Y dx' dy' W(x -x',y- y'; a) S(x', y'; ex), 

(9) 
where the region of interest is \x'\ < X and \y'\ < Y, and 
W is again specified by equation Q, is computed first. 

We choose a coordinate system in which the peak of 
S is centered at the origin. The quantity tp(x,y;a) = 
C(x,y;ct)/(aia2) 1 ' 2 has a maximum value at the origin 
when di — <7iVS and (/) — (f>o ()Damiani et al.l H997). The 
source parameters are determined by maximizing if). 

In practice, equation (J9j) is evaluated as a discrete sum 
over the pixels of the image. Although integration of 
equation @ does not yield a simple closed-form solution, 
and numerical integration is computationally expensive, 
for the purpose of optimizing V>o = C(0, 0; a)/{a\a2) 1 ^ 2 , 
a rectangular approximation for the integral over each 
pixel is sufficient. In this approximation, 

W mn (xi,yj;oc) w W(x m - Xi, y n - yj-,oc)AxAy, 

where W(x,y;a) is evaluated at the center of each pixel 
and the pixel area is Ax Ay. 

A small sub-image of the source is extracted centered 
on the source position determined by wavdetect. The 
accuracy of this source position is refined by searching 
the center of the sub-image for the coordinates (2:0,2/0) 
that maximize ipo(x, y, a, a, 0). A new sub- image is then 
extracted using the improved source position. 

Finally, the size and orientation of the elliptical 
Gaussian source parameterization are derived by max- 
imizing ip(xo,yo',oc). Choosing good initial values for 



G&j helps to ensure that this optimization step con- 
verges reliably. We set the initial values a\ = a 2 — 
max [(didz) 1 ' 2 , Ogrid]) where di are the ellipse semi-axes 
derived by wavdetect, and the value of a gr id is obtained 
by examining V>o (#0 > J/0 i a i Q-> 0) on a grid of a values span- 
ning the half-width of the source image; a gr id is usually 
the smallest a that corresponds to cither a local maxi- 
mum or an inflection point. This choice is motivated by 
the observation that when a single source is present, the 
location of a local maximum provides a good estimate 
of the source size (see Figure 19). Similarly, when the 
source of interest is blended with other nearby sources, 
an inflection point where d^ipo = often occurs near 
the "edge" of the central source. When the first occur- 
rence of <9 a V>o = occurs at a local minimum, a gr id is the 
smallest value of a on the pixel grid. 

3.6.1. PSF Extent 

The spatial extent of the local PSF is determined for 
comparison with the observed source extent, and as an 
aid to assessing the intrinsic extent of the source. Since 
the size of the Chandra PSF is a strong function of off- 
axis angle, a ray-trace model is constructed at the mea- 
sured off-axis and azimuthal angles (9, <p) separately for 
each detected source. Although the shape of the PSF is 
energy-dependent, within each energy band the ray-trace 
model is computed only at the monochromatic effective 
energy (see § 2.5.2) of the band. This approximation 
results in an error that is dependent on the actual spec- 
trum of the source, but that does not exceed ~ 10% for 
typical power-law or black-body source spectra. 

The ray-trace model is computed u sing version 1.0.0 o f 
the SAOTrace simulation code (|Jerius et al.lll995l 12004 ) 
with the latest HRMA optical coefficient^ derived from 
calibration observations. A ray density of 0.2 rays/ mm 2 
is used for the ray-trace. The rays are then projected 
onto the detector focal plane, a Gaussian blur is applied 

9 The orbit_XRCF+tilts+ol_01b calibration model configura- 
tion. 
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Fig. 19. — Deriving a correlation scale length from the shape 
of ipo( a )- Three curves are shown, schematically illustrating the 
different shapes that ipo{a) can have. An unblended source nor- 
mally yields a distinct local maximum (point 1). Depending on 
the source spacing, closely spaced sources may yield an inflection 
point (point 2) or a local minimum (point 3). The text describes 
how each of these cases is treated. 

to account for the degradation due to de-dithering the 
telescope motion (a = 0"148 for ACIS, 0'.'2 for HRC), 
and the image is resampled onto the pixel plane of the 
detector. Each PSF image is recorded in the CSC as 
a file-based data product (see Table 3), which can be 
retrieved by the catalog user and compared directly to 
the corresponding source region image. 

The ray density used in the ray-trace models typically 
produces a total of ~6-8 x 10 3 counts (with a full range 
of ~ 4-13 x 10 3 counts) in the resulting PSF model in the 
ACIS broad, hard, and soft energy bands, where the com- 
bined HRMA/instrument effective area at the monochro- 
matic effective energies of the bands is ~ 300-400 cm 2 . 
The number of counts in the ACIS medium energy band 
PSF models is about 60% higher, because of the higher 
effective area at that band's monochromatic effective en- 
ergy. For the ACIS ultra-soft energy band the total PSF 
counts may be only a few hundred because of the poor 
quantum efficiency in the band. Although the PSF mod- 
els computed here are sufficient for their intended pur- 
poses of providing basic estimates of source extent and 
point source aperture corrections for aperture photom- 
etry, they are not suitable for analyses such as image 
deconvolution that require detailed PSF models. 

Once the image of the local model PSF is con- 
structed, the rotated elliptical Gaussian parameteri- 
zation, p(x,y;bi,b2,ip), of the spatial extent is com- 
puted in the same way that the observed source extent, 
S(x, y; <7i, (72, </>o), is computed from the source image. 

3.6.2. Intrinsic Source Extent 

Using the rotated elliptical Gaussian parameteri- 
zations derived above, the observed source extent, 
S(x,y;ai,<j2,4>o), can be treated as the convolution of 
the local PSF, p(x,y;bi,b2,i/j) with the intrinsic source 
extent, s(x, y',ai,a2,<f>), where we parameterize the lat- 
ter similarly to equation (j8[). In general, 4>o 7^ 4>i since 
the PSF-convolved ellipse need not have the same orien- 
tation as the intrinsic source ellipse. 

In principle, one can determine the parameters, 
(ai,a,2,<fi), of the intrinsic source ellipse by solving a 
nonlinear system of equations involving the PSF param- 
eters, (6i,&2,V0, an d the observed source parameters, 
(o"ij o"2, 0o )• However, because these equations are based 



on simple assumptions regarding the source and PSF pro- 
files, and because the input parameters are often uncer- 
tain, such an elaborate calculation seems unjustified. 

A much simpler and more robust approach makes use 
of the identity 



2 2 

o~ 2 = a l 



b\ + b 2 2 , 



which applies to the convolution of two elliptical Gaus- 
sians having arbitrary relative sizes and position angles. 
Using this identity, one can define a root- sum- square in- 
trinsic source size, 



1 / 2 



2\V2 



: -= max [0, (a 2 + <7. 



1/2 



{b\ + bDY'\ (io) 



that depends only on the sizes of the relevant ellipses 
and is independent of their orientations. This expression 
is analogous to the well-known result for convolution of 
one-dimensional Gaussians and for convolution of circu- 
lar Gaussians in two dimensions. The factors of l/v2 
ensure that the statistic value gives the radius of the 
source image when applied to circular source images. 

Using equation (fT0|) . one can derive an analytic ex- 
pression for the uncertainty in a rss in terms of the mea- 
surement errors associated with o~i and &;. Because at 
and bi are non-negative, evaluating the right-hand side 
of equation (JTUJ) using the corresponding mean values 
should give a reasonable estimate of the mean value of 
a Iss . A Taylor series expansion of the right-hand side of 
equation (|10p evaluated at the mean parameter values 
therefore yields the uncertainty 

Aa rss = 

^ [a? (Aar) 2 + a 2 (A<7 2 ) 2 + 6 2 (A6r) 2 + 6 2 (A6 2 ) 2 ] ^ , 

where (AX) 2 represents the variance in X, and where 

a rss > 

&rss — U. 



vw 



H 



3.6.3. Combining Intrinsic Source Extent Estimates from 
Multiple Observations 

Measurements of the mean intrinsic source extent de- 
rived from multiple independent observations, a rsSi i ± 
Aa rsSj i, are combined using the multivariate optimal 
weighting formalism, equations ([B]). The minimum vari- 
ance estimator of the intrinsic source size is the variance- 
weighted mean, 

a rss = Var[a rss J y ^ Var[a rsS] jJ a r ss,i, 

i 

where Var[a rss ,i] = (Aa rss i) 2 . The variance in a^ is 



Var[a r 



£Var[ 



3.7. Aperture Photometry 

Net source counts, count rates, and photon and en- 
ergy fluxes for point sources are computed from counts 
and exposure data accumulated in independent source 
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and background apertures, R s and Rb. Typically, these 
apertures are simple elliptical regions and surrounding el- 
liptical annuli, but arbitrary areas from either aperture 
may be excluded to avoid contamination from nearby 
sources, or missing data due to detector edges. The net 
aperture areas, A s and Ab, and the fractions, a and p, of 
source counts expected in both apertures are determined 
from 



A„ 



dxdy ; Ab 



dxdy 



Ih 



and 



dxdyPSF(x,y); (3= / dxdy PSF(x,y). 
R s JRb 

We use the PSFs described in § 3.6.1 to estimate a and 
P. Although the finite number of PSF counts leads to 
some uncertainty in the estimate for /?, the effect of this 
uncertainty on the derived net counts, count rates, and 
fluxes is small (typically <1%). 

If a uniform background over the scale of R s and Rb 
is assumed, then the net source counts with aperture 
corrections applied, S, can be determined by solving the 
simultaneous set of linear equations 



C = aS + b; B = (3S + rb, 



(12) 



where C and B are the total counts in R s and Rb, respec- 
tively, b represents the background in R s , and r = Ab/A s . 
The solution is 

S=(rC-B)/(ra-f3). 

In general, selecting a background aperture so that 
P — > is difficult, since the inner radii of such annuli 
could range from ~25" to > 1000", depending on 9 and 
energy. Such large background apertures would be sub- 
ject to errors due to intrinsic background variations, dif- 
fuse source emission, and background contributions from 
multiple detector chips. We choose rather to use smaller 
apertures, containing ~ 5-10% of the source flux (see 
Fig. 14), whose effects can be modeled more accurately. 

To apply a consistent statistical approach in determin- 
ing confidence bounds for all photometric quantities (see 
below), we assume that the generic photometric quan- 
tity S (whether counts, count rate, photon flux, or energy 
flux) can be converted to counts by multiplying by appro- 
priate generic conversion factors / and g defined below, 
averaged over R s , Rb, and generalize equations (|T2"j) to 
include these terms, 



C = fS + b; B = gS + rb. 



(13) 



For example, if S represents a count rate, / = a(T s ) 
and g = P(Tb), where (T s ) and (T&) represent average 
exposure times in R s and Rb, respectively. The corre- 
sponding definitions for photon flux are / = a(E s ) and 
g — P(Eb), where (E s ) and (Eb) are the average exposure 
map values (in cm 2 s) , computed at the monochromatic 
effective energy of the band, in R s and Rb, respectively. 
For energy flux, / = a/(F s ) and g = P/(Fb), where (F s ) 
and (Fb) represent average fluxes (in erg cm" 2 s" 1 ) in R s 
and Rb, respectively. The values (F s ) and (Fb) are deter- 
mined by applying quantum efficiency and effective area 
corrections to individual event energies in R s and Rb, 
and computing the averages of the resulting quantities. 



Finally, we relax the assumption of uniform back- 
ground over R s and Rb by defining r = Ab(Tb) / A S (T S ) 
for source rate, r = Ab(Eb)/A s (E s ) for photon flux, and 
r = Ab(F s )/A s (Fb) for energy flux. With these defini- 
tions, the general solution for S may be written as 



S=(rC-B)/(rf-g). 



(14) 



To determine confidence bounds for S, the background 
marginalized posterior probability density is computed 
first, with the assumption that C and B are Poisson- 
distributed random variables whose means are 9 — fS+b 
and 4> — gS + rb, respectively. The posterior probability 
density for S may then be written 

p(S\CB) = / dbp(Sb\CB). 
Jo 

To determine p(Sb | CB), we use B ayes' Theorem to write 
the joint posterior probability density for p(9(f>\CB), tak- 
ing advantage of the fact that R s and Rb are independent: 



p(04>\CB) 



P (e)p(c\d) P (4>)p(B\<t>) 



p(CB) 
The likelihoods are simple Poisson probabilities, 



p(C\9) 



p(B\<j>) 



r(c + i)' 1 ' K ~ , ' rj r(B + i)' 

and we use generalized 7-priors for p(9) and p(4>): 



p(9) 



pl^^s^e-P 30 



r(7T S ) 



p&) = 



Pb 



K*b-1, 



-PB<j 



IVb) 



where the parameters its, PSi n B, and pb define the 
function shapes, and P(CB) is determined through nor- 
malization of p(9<p\CB). Once p(9<f>\CB) is known, 
p(Sb\CB) may be found from simple substitution of vari- 
ables, 



p(9(j>\CB)d9dcj)=p(9(S, b)(j)(S, b)\CB) 



3(9, cp) 



d(S,b) 
p(Sb\CB)(rf - g)dSdb. 



dSdb 



Details of the derivation may be found in V. Kashyap & 
F. A. Primini (2010, in preparation), but here we merely 
cite the final results under the additional assumption of 
non-informative priors ns = ttb = 1 and p$ = Pb =0: 



p(S\CB)dS = dS(rf 



C B 



gS 



fc=0 j=Q 



(fS) k e-f s (gS)ie- 
T(k + 1) T(j + 1) 



x e 
x e 



(B-j)ln(r)+ln(T(C+B-k-j+l)) 
-ln(r(C-k+l))-ln(T(B-j+l))-(C+B-k-j+l)ln(l+r) 



Because of the computationally intensive nature of this 
expression, p(S\CB)dS is approximated with an equiva- 
lent Gaussian distribution when C + B > 50 counts. The 
validity of this approximation is verified through simula- 
tions. Examples of p(S\CB) for three CSC sources are 
shown in Figure 20. 

By using the different definitions for / and g as de- 
scribed above, probability densities for net counts, rates, 
and photon and energy fluxes can then be computed. 

Confidence bounds are determined by numerically in- 
tegrating p(S\ CB) in alternating steps above and below 
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Fig. 20. — Probability distributions p(S\CB) for the number of net source counts S in the ACIS broad (b) band for three different CSC 
source observations. C and B are the total counts in the source and background region apertures, respectively; a and are the fractions 
of the source counts expected in each aperture; and r is the ratio of the background to source region aperture areas. p(S\CB) is computed 
as described in the text. Left: O = 7, B = 41, a = 0.83731, = 0.13587, r = 23.41 (6 band flux significance = 1.94). Center: C = 11, 
B = 26, a = 0.89741, = 0.08607, r = 16.36 (6 band flux significance = 3.04). Right: C = 90, B = 10, a = 0.89832, = 0.07743, 
r = 24.02 (b band flux significance = 9.4). 



its mode until the desired confidence level is achieved. 
The values of S at these points then determine the con- 
fidence bounds. If the value of S = is reached before 
summation is complete, or if the mode itself is 0, integra- 
tion continues for S above the mode, and the resulting 
bound is considered an upper limit. 

We note that in our approach, photon flux and en- 
ergy flux are determined somewhat differently. While 
average exposure map values are used when computing 
photon flux, in the case of energy flux the values (F s ) 
and (Ff,) are determined by applying quantum efficiency 
and effective area corrections to individual event ener- 
gies. For sources with few counts in either the source or 
background region, a single photon detected at an un- 
characteristically low or high energy (where the Chandra 
effective area is small) can make a dominant contribu- 
tion to the estimated energy flux. In such cases, the 
true uncertainty will be significantly larger than our es- 
timated errors. A post facto comparison of energy flux 
estimates computed in this manner with energy fluxes 
calculated using an assumed canonical power-law spec- 
tral model (see § 3.10, below), indicates that fewer than 
1% of ACIS broad energy band fluxes are affected by this 
problem. A more detailed analysis of the statistical ac- 
curacy of the energy flux determinations is provided by 
F. A. Primini et al. (2010, in preparation). 



3.7.1. Determining Flux Significance 

Significances for all aperture photometry quantities 
are determined directly from the probability densities 
p(S\CB). Our goal is to provide a simple statistic that 
is robust to calculate, easily interpretable by non-expert 
users, and consistent with the classical SNR definition 
for high count sources. To this end, we compute the 
FWHM of p{S\CB), since the latter has a well-defined 
width even for low-significance sources in the catalog, as 
shown in Figure 20. If S — is reached before the half- 
maximum point below the mode is found, the HWHM 
is computed from values above the mode and FWHM is 
set equal to 2 x HWHM. The FWHM is then used to 
compute the "equivalent <r" for a Gaussian probability 
density, 

FWHM 



The flux significance value that is reported in the catalog 
for a source is defined to be SNR = S/a e , where S is 
determined from equation (fT4|) . This value must be at 
least 3.0 in at least one energy band for an observation of 
a source to be included in the first release of the catalog. 
The flux significance threshold that we use imposes 
a conservative limit on sources included in the CSC, 
which we deem necessary to reduce the number of spu- 
rious sources at low count levels to an acceptable value. 
Comparing our results to those of other large Chandra 
surveys whose source lists are derived from wavdetect, 
but whose detection procedures differ, is useful. In Fig- 
ure 21, we compare the distribution of net counts for 
CSC sources detected in the ACIS broad (0.5-7.0 keV) 
energy band with distributions of similar quantities for 
four other Chandra catalogs derived from a range of 
AC IS exposures comp arable to those in the CSC: AEGIS- 
X (lLaird et all I2009L 0.5-7.0 keV), the Galactic Cen- 
ter catalog (jMuno et al; 2009, 0.5-8.0 keV), C -COSMOS 
(lElvis et al.ll2009l 0.5-7.0 keV). and ChaMP (|Kim et al.l 
120071 0.5-8 OkeVV We note that while these other cat- 
alogs do include sources with fewer net counts than the 
CSC, the additions are in general not large, compris- 
ing - 5%, - 9%, ~ 9%, and - 24%, for AEGIS-X, the 
Galactic Center catalog, C-COSMOS, and ChaMP, re- 
spectively. We attribute the larger percentage in ChaMP 
to the restricted fields-of-view and the careful manual 
screening of source detections used when constructing 
that catalog. The CSC appear s to fare worse in com- 
parison to the XBootes survey (jKenter et al. 20051 0.5- 
7.0 keV), most of whose sources have fewer than 10 net 
counts. However, the XBootes survey is composed of 
many 5 ks non-overlapping observations for which the 
very low ACIS background allows a lower count thresh- 
old. In contrast, the CSC is constructed from obser- 
vations comprising a wide range of exposures, ~ 70% 
of which are greater than 5 ks and ~ 10% of which are 
greater than 50 ks. Finally, as mentioned in § 3.4, ~l/3 
of all sources detected by wavdetect in the ACIS broad 
energy band fall below the flux significance threshold. 
However, we expect that a substantial fraction of these 
sources are spurious. 
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Fig. 21. — Comparison of the distribution of net counts for CSC 
sources detected in the ACIS broad energy band with distributions 
of similar quantities for four other Chandra catalogs derived from 
a range of ACIS exposures comparable to those in the CSC. 



3.7.2. Combining Aperture Photometry from Multiple 
Observations 

Ideally, one should compute aperture photometry 
quantities for combined observations by computing a 
joint probability density, using p(S\CB) from one obser- 
vation as the prior for the next. However, this approach 
is difficult to implement and computationally expensive, 
especially when probability densities from individual ob- 
servations do not overlap significantly. We have there- 
fore chosen simply to combine aperture data from various 
observations and compute a single p(S\ CB) from those 
data. We compute 






Cfc = SX)/i + X>;X;Bi = SX> + 5>i&i. 



To cast these in the same form as equations (TT3"1) . we 
define r' — 'Ylii r i^ ) il Xi^> where the bi are determined 
from the solutions to equations (|13[) for individual obser- 
vations. One can then write 



E 



c, 



5 X> +$>;!>< = sX>+r / $> 



which are identical in form to equations (|13p . The com- 
bined aperture photometry quantities and bounds can 
then be determined as described earlier. 

3.8. Computing Limiting Sensitivity 

For the purposes of the catalog, "limiting sensitivity" 
is defined to be the flux of a point source that meets but 
does not exceed the flux significance threshold for inclu- 
sion in the catalog. Limiting sensitivity is a function of 
source position, background, and the algorithm used to 
calculate flux and flux significance. At any point within 
the field of view of an observation, the limiting sensitiv- 
ity can be used as a simple X-ray flux limit for individual 
sources detected at other wavelengths. For the catalog, 
a full-field sensitivity map is provided for each observa- 
tion and energy band as a file-based data product (see 
Table 3). These data are also required to calculate sky 
coverage histograms (solid angle surveyed as a function 
of limiting flux), which are themselves needed to calcu- 
late luminosity functions and source surface brightness 
versus number density relationship. 



As described above, the flux significance of a cata- 
log source is defined to be the ratio of the source flux 
to the equivalent a determined from the width of the 
flux's posterior probability density. There is no equiva- 
lent quantity for sensitivity, and for simplicity and ease 
of comp utation, we use a t echniq ue similar to that devel- 
oped by iMaccacaro et "ahl ()1982f ) for the Einstein Obser- 
vatory Medium Sensitivity Survey, namely, we approx- 
imate significance using the aperture photometry rela- 
tions of § 3.7, under the assumption of Gaussian statis- 
tics, and use the model background maps, randomized 
to provide statistics appropriate to the observation in 
question, to determine aperture counts. 

Recall from equation (fT4)) that the flux may be written 
as 

S=(rC-B)/(rf-g). (15) 

Since C and B are independent random variables, the 
variance on S may be written 

, r 2 a 2 + a 2 r 2 C + B 



* (rf-g) 2 (rf-g) 2 ' 

assuming Gaussian statistics. The significance, S/o~s 
may then be written as 



S/a s 



(rC - B) 
Vr 2 C + B' 



(16) 



The limiting sensitivity is found by determining the 
minimum number of counts C m i n in the source aper- 
ture that yields the flux significance threshold SNR m [ n 
in equation (Ti6"|) . 

(rC min - B) 



SNR m i n 
whose solution is 



%/^a 



B 



rC m in — B 



rSNR 




and the limiting sensitivity for that aperture is then given 
by equation (fT5")l , 



Smin = (rCmin - B) / ' (r f - (?) 

ri 



"SNRl, 



1 + 4/1 



4B 



rSNRi 




x (rf - g)~ 



DiVxl m £ n J / 



AB 



2/ [ V rSNR 
where we have approximated 




(rf - g)- 1 * (rf)- 1 1 + -^ \ * (rf) 



g_ 

rf 

Since the limiting sensitivity maps are computed from 
background maps with no real sources, information 
about real source apertures is unavailable. Rather, for 
each element in the map, circular source and annu- 
lar background apertures appropriate to the 90% ECF 
source aperture at that location are constructed, and 
used to determine B, r, and / for use in equation (fl7|) . 
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The assumption of Gaussian statistics, and the subse- 
quent simplification in the algorithm, is made of neces- 
sity, since limiting sensitivity must be computed not for 
each source but for each pixel in each of 5 energy band 
images. We have, however, verified the performance of 
the algorithm by comparing detected source fluxes with 
values of limiting sensitivity at the source locations, for 
thousands of catalog sources in all energy bands (F. A. 
Primini et al. 2010, in preparation), and find good agree- 
ment. 

3.9. Spectral Model Fits 

For observations of sources with at least 150 net counts 
in the energy band 0.5-7 keV obtained using the ACIS 
detector, we further characterize the intrinsic source 
properties by attempting to fit the observed counts spec- 
trum with both an absorbed black-body spectral model 
and an absorbed power-law spectral model. These two 
models represent basic spectral shapes of thermal and 
non-thermal X-ray emission. 

The standard forward fitting method used in X-ray 
spectral analysis computes the predicted counts pro- 
duced by the spectral model with the observed counts 
in the detector channel space, and iteratively refines the 
model parameters to improve the qu ality of the fit. 

Instrumental response functions (Davis 2001a) de- 
fine the mapping s between phy s ical ( source) space and 
detector space. iGeorge et al.l (j2007t ) describe two of 
these calibration files, the detector redistribution ma- 
trix file (RMF) and the ancillary response file (ARF). 
The former specifies the energy dispersion relation 
R(E',p';E,p,t) that defines the probability that a pho- 
ton of actual energy E, location p, and arrival time t will 
be observed with an apparent energy E' and location 
p' , while the instrumental effective area A(p'; E,p,t) is 
recorded in the latter. The final dispersion relation is the 
photon spatial dispersion P(p'; E,p,t) transfer function 
due to the instrumental point spread function. 

With these definitions, the model M(E',p',t) that de- 
scribes the expected distribution of counts arriving at the 
detector is then 

M(E',p',t)= (18) 

JdEdpR(E';E,p,t)P{p';E,p,t)A(E,p',t)S(E,p,t), 

where S(E,p,t) is the physical model that defines the 
physical energy spectrum, spatial morphology, and tem- 
poral variability of the source. 

We follow standard practice by ignoring the depen- 
dency on photon arrival time, and instead consider only 
the total number of photons that arrived during the ob- 
servation in the forward fitting process. The source posi- 
tion and shape are taken as known, and we assume that 
the source photons are collected from the detector area 
containing an entire source region of interest. The latter 
assumption is valid provided that sources are spatially 
separated on scales of order the size of the PSF or larger. 
In crowded fields, or for sources that have a complex dif- 
fuse structure, the contribution from the other sources 
are important. With the assumptions listed above, equa- 
tion (|18p reduces to 

M(E')= J dER(E';E)A(E)S(E), 



where the source emitted spectrum S(E) depends on the 
source physics. The forward fitting procedure solves for 
the best fit parameters for S(E), assuming a pre-defined 
fit statistic. Since spectral fitting is only performed for 
sources with a minimum of 150 net counts, a \ 2 At statis- 
tic is used, but note that this assumes a Gaussian distri- 
bution for the source counts. 

For all sources observed using the ACIS detector (i.e., 
not just those with at least 150 net counts in the broad 
energy band), the catalog processing pipelines extract 
the observed energy spectra of the photons included 
in the source and background regions of each detected 
source and store these i n a standard format (PHA file; 
lArnaud fe George] 120091 ). An appropriate associated 
ARF and RMF are computed by weighting the instru- 
mental responses based on the history of how the source 
and background regions move over the surface of the de- 
tector due to the spacecraft dither motion. The extracted 
spectra, and associated ARF and RMF are stored as file- 
based data products (see Table 3) and can be retrieved 
by the user for further analysis such as low-count spectral 
fitting or spectral stacking. 

To fit the background subtracted data, each PHA 
spectrum is grouped to a minimum of 16 counts per 
channel bin, and the source model parameters are var- 
ied to minimize the x 2 statistic (assuming data vari- 
ance, of = N. hS + (As/A B ) 2 Ni_ B )- Two models are 
applied to the data in order to evaluate source prop- 
erties: (1) an absorbed blackbody model f(E) — 
exp -JVH<r B A(E 2 /(exp E / kT -1)); and (2) an absorbed 
power law model f(E) = exp _JVHffE AE~ r . In 
these models, iVu is the equivalent Hydrogen column 
de nsity, ue is the photo-electric cross-se ction based 
on I Balucinska- Church fe McCammonl (11992ft and metal 
abundances from lAnders fc Grevessg |l989), A is the 
model normalization at E = 1 keV, kT is the blackbody 
temperature, and T is the power-law photon index. For- 
ward fitting is performed using the Sh erpa fitting engine 
(|Freeman et al.H2001l ; lDoe et al.ll2007t) . Sherpa finds the 
best fit model parameters and calculates two-sided confi- 
dence intervals for each significant parameter. The model 
flux for the best fit parameters over the energy range 0.5- 
7keV is also calculated. 

The 68% (1 a) confidence limits for each parameter 
are calculated using the "projection" method in Sherpa. 
This method finds the two-sided confidence bounds in- 
dependently for each parameter. The algorithm assumes 
that the current model has been fitted, and that all of 
the parameters are at the values corresponding to a best 
fit which is at the minimum of the fit statistic (y 2 :„). 

\* V IU1I1 / 

For each parameter of interest, the search for the lower 
or upper bound starts at the best fit value, which is then 
varied along the parameter axis. At each new value, the 
parameter of interest is frozen and a new best fit model is 
determined by minimizing yf over the remaining thawed 
parameters. The new x statistic, Xnew ^ s determined 
and the difference between the new and the minimum 
statistics, Ay 2 = X 2 GW — Xmin' i s calculated. A change in 
Ay 2 equal to 1 corresponds to a 68% confidence bound 
(|Avni|[l976l ). so the parameter of interest is varied until 
this value of A% 2 is obtained. 

We note here that energy-dependent aperture correc- 
tions are not applied when performing the spectral model 
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fits. Since the Chandra PSF is somewhat more extended 
at higher energies, the lack of correction has the effect 
of slightly softening the calculated spectral slope. The 
correction in T is approximately 0.03-0.05 for powerdaw 
spectra with a wide range of spectral indices. For sources 
included in release 1 of the CSC for which spectral fits 
have been performed, the error in T introduced by not ap- 
plying energy-dependent aperture corrections is roughly 
six times smaller than the median computed 1 a confi- 
dence limits. About 2.5% of sources with spectral fits 
have computed confidence limits < 0.05, and these cases 
appropriate caution should be exercised when using the 
spectral fit properties. 

For most source properties, values recorded in the Mas- 
ter Sources Table are computed by combining the rele- 
vant data from the set of observations in which the source 
is detected. However, for simplicity, the spectral model 
fit parameter values recorded in the Master Sources Ta- 
ble are taken directly from the single observation of the 
source that has the highest significance, equation (p~6|) . In 
this case, data from multiple observations are not com- 
bined to compute the Master Source Table spectral fit 
properties. 

3.10. Spectral Model Energy Fluxes 

Spectral model fits are not performed for sources with 
< 150 net counts. However, for all sources we estimate 
energy fluxes using canonical absorbed power-law and 
black-body spectral models. 

For a canonical source model S(E) whose integral over 
the energy band is S", an corresponding band count rate, 
C" in counts s _1 , can be computed from the effective area 
calibration, A(E), and the RMF, R(E',E). The count 
rate is / dE R(E'; E) A(E) S{E), where the integral is 
performed over the energy band. For HRC observations, 
a diagonal RMF is assumed. The actual flux of a source 
can be estimated from S' by scaling the latter by the ratio 
of the measured and modeled source aperture count rates 
in the energy band. Since the only free parameter in this 
case is the normalization of the model, the calculation 
can be performed for sources with too few counts for a 
reliable spectral fit. 

The canonical power-law spectral model has a fixed 
photon index T = 1.7, which falls in the range of 
valu es (r ~ 1.5-2.5) that are ty pical of AGN spec- 
tra (Ushibashi fc Courvoisierl I2010D . The value chosen 
matches the photon index used to convert count rates to 
energy fluxes in the sec ond XMM-Newton s erendipitous 
source catalog f2XMM; IWatson et al.ll2008f) . to simplify 
comparison of CSC and 2XMM source fluxes. Since we 
anticipate that the majority of sources with spectra that 
are best fit by a power-law model are AGN, we fix the to- 
tal neutral Hydrogen absorbing column Ah equal to the 
Galactic column, Ah (Gal), under the assumption that 
this represents a lower limit to the true column density. 

The canonical black-body spectral model has a fixed 
temperature kT — 1.0 keV and total neutral Hydrogen 
column density Ah = 3 x 10 20 cm~ 2 . The latt er value 
match es the median column density identified bv lSaxtonl 
(2003), and also corresponds to the typical col umn den- 
sity found within lkpc of the Sun (JLiszt 1983|). As dis- 
cussed by iMcColloughl (|2010l) , the choice of black-body 
temperature is a compromise between the possible ranges 
of values for different classes of thermal X-ray emitters. 



Sources for which a thermal model best represents the 
data will likely lie in our galaxy, and so in this case set- 
ting Ah = Ah (Gal) would overestimate the total neutral 
Hydrogen absorbing column. 

Similar to spectral model fits, master source spectral 
model energy fluxes are taken directly from the single ob- 
servation of the source that has the highest significance. 

3.11. Spectral Hardness Ratios 

While the spectral model fits described in § 3.9 provide 
detailed information about a source's spectral properties, 
only about 10% of the source observations included in 
the CSC have sufficient net counts to perform the fitting 
process. As an aid to characterizing the spectral proper- 
ties of the remaining catalog sources, hardness ratios are 
computed between the hard, medium, and soft energy 
bands for all sources observed with the ACIS detector. 

The spectral hardness ratio for the pair of energy bands 
x and y is defined as 



tLI\-xy 



F — F 

± x ± y 

F b 



(19) 



where F x and F y are the photon fluxes measured in the 
energy bands x and y respectively (a: is always the higher- 
energy band of the pair) , and F b is the photon flux in the 
ACIS broad energy band, F b = F h + F m + F s . 

A catalog source may be readily detected in one or 
more energy bands, but remain undetected or include 
very few total counts in other bands. Since hardness ra- 
tios are cross-band measures, a technique that applies 
rigorous statistical methods in the Poisson regime is re- 
quired to compute these values and their associated con- 
fidence limits robustly. The hardness ratios included in 
the CSC are computed usin g a Bayesian approach de- 
veloped bv lPark et al.l (|2006f) . which should be consulted 
for a detailed description of the algorithm. 

To ensure that the Poisson errors are propagated cor- 
rectly, the conversion between counts and photon flux for 
each energy band is modeled as a linear process, with a 
scale factor that is determined from the effective area of 
the telescope/instrument combination computed at the 
monochromatic effective energy of the band. This im- 
plies that the photon fluxes in equation (|19[) may not 
match exactly the aperture photometry fluxes derived in 
§3.7. 

Specifically, we model the observed total and back- 
ground counts, C x and B x , in the hard, medium, and 
soft ACIS energy bands as 

C a; -Poisson[e 2; (A a; + £ x )], 
B a; ~Poisson|re x £ ( :], 

where x represents the energy band (one of h, m, or 
s); Aa; and £ x are the expected source and background 
counts intensities, respectively; e x are the conversion fac- 
tors that scale counts to photon fluxes; and r is the ratio 
of the background aperture area to the source aperture 
area. 

With these definitions, the spectral hardness ratio for 
the pair of energy bands x and y is determined by com- 
puting the expectation value 

A K — A„ 



rLK X y — ^ 
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TABLE 6 
Intha-Observation Variability Indices 



Variability 
Index 



Condition 3 



Meaning 



pgl < 0.5 

1 0.5 < pgl < 0.667 AND f 3 > 0.997 AND f 5 = 1.0 

2 0.667 < pgl < 0.9 AND f 3 > 0.997 AND / 5 = 1.0 

3 0.5 < pgl < 0.6 AND (/ 3 < 0.997 OR / 5 < 1.0) 

4 0.6 < pgl < 0.667 AND (/ 3 < 0.997 OR f 5 < 1.0) 

5 0.667 < pgl < 0.9 AND (/ 3 < 0.997 OR f 5 < 1.0) 

6 0.9 < pgl AND O < 2.0 

7 2.0 < O < 4.0 

8 4.0 < O < 10.0 

9 10.0 < O < 30.0 

10 30.0 < O 



Definitely not variable 
Not considered variable 
Probably not variable 
May be variable 
Likely to be variable 
Considered variable 
Definitely variable 
Definitely variable 
Definitely variable 
Definitely variable 
Definitely variable 



a pgl is the Gregory-Loredo variability probability, equation (22); / 3 and /s are the 
fractions of the light curve that fall within 3 a and 5 a of the average rate, respectively; 
and O = ^"^™ ax Oj is the sum of the odds-ratios, equation (21), for two or more bins. 



Following the lead of iPark et al.l (|2006[ ). the joint pos- 
terior probability distribution can be written as 

p{X s , A m , Xh\C s ,C m ,Ch, B s , B m , Bh) = 

P (X s \C s ,B s )p(X m \C m ,B m )p(X h \C h ,B h ), 

where we have made use of the fact that the X x are inde- 
pendent. Marginalizing over nuisance variables yields the 
posterior distributi on for the hardn ess ratios (equivalent 
to equation [14] of IPark et aTll2006fl : 

p('HlZ X y\C s ,C rn ,Ch,B s ,B m ,Bi l )a"HlZ X y = 



dHlZxy 



1JJ,UJ 



drb duj 



x p(WZ xy ,ip,uj\C s ,C m ,C h ,B s ,B m ,B h 



where ip — X x + X y and uj = X s + X m + Xh- 

The spectral hardness ratios that are included in the 
CSC are determined separately for each observation in 
which a source is detected, and also from the ensemble of 
all observations of the source. The former quantities are 
recorded in the Source Observations Table, while the lat- 
ter are recorded in the Master Sources Table. The prior 
probability distributions used to derive the Bayesian pos- 
terior probabilities are computed differently in these two 
cases. 

For a single obse rvation, non-informa tive conjugate 7- 
prior distributions (|van Dvk et al.ll200lD are used for the 
source and background intensities. These distributions 
ensure that the posterior probabilities conjugate to the 
expected Poisson distributions of counts with no other 
prior information. When multiple observations are com- 
bined, the ensemble hardness ratios are computed by 
stepping through all of the observations of a source in 
order of increasing net broad-band source counts. The 
posterior probability distribution computed from each 
observation is used as the prior probability distribution 
for the subsequent step. If the propagated prior prob- 
ability distribution is not consistent with the observed 
counts in any step, then a conjugate 7-prior is used in- 
stead, and a catalog flag is set to indicate that the source 
spectrum is variable. 



3.12. Estimating Source Variability 

The CSC includes estimates of the probability that the 
flux from a source is temporally variable both within 
a single observation and between two or more observa- 
tions in which the source was detected. These estimates 
are distinguished not only by the fact that they mea- 
sure variability on different time scales, but also because 
their definitions differ fundamentally. Within an obser- 
vation, we measure the probability that the source flux 
is not consistent with a constant level during a (largely) 
continuous observation, which is equivalent to estimat- 
ing the probability that the source is variable, and is a 
positive statement with respect to variability. The inter- 
observation variability estimates measure the probability 
that the average flux levels during the different observa- 
tions are consistent with a uniform source intensity. This 
provides only a lower limit to the probability of the source 
being variable, since we have no information about the 
source's behavior during the gaps between the observa- 
tions, which are often widely separated. 

The intent of the various variability measures included 
in the CSC is to provide users a means to easily select 
potentially variable sources. The individual source light 
curves or event lists should be assessed to reveal the true 
nature of the source's temporal characteristics. Moder- 
ately intense background flares that are not rejected as 
part of the enhanced background event screening (see 
§ 3.2) may cause sources to be incorrectly identified as 
variable. This possibility can be evaluated by comparing 
the structure of the source and background light curves. 

3.12.1. Intra- Observation Variability 

The probability that a source is variable is estimated 
separately in each energy band using the Gregory- 
Loredo and Kolmogorov-Smirnov (K-S) algorithms, and 
Kuiper's variation on the latter. A brief description 
of each of the three algorithms is provided below. 
Gregory-Loredo probabilities are used to construct intra- 
observation variability indices that provide a shorthand 
measure of variability. 

All three algorithms directly use the photon event ar- 
rival times to compute the variability probabilities, and 
apply corrections for variations of the geometric areas of 
the source and background region apertures due to the 
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spacecraft dither-induced motion during the observation. 
The latter corrections are necessary since a source region 
that is moving across the edge of the detector or over a 
bad detector region might otherwise be erroneously clas- 
sified as variable. Optimal resolution light curves are 
generated as by-products of the Gregory-Loredotest, and 
their power spectra are evaluated for the presence of the 
fundamental spacecraft pitch and yaw dither frequencies 
or associated beat frequencies. If there is a peak in the 
power spectrum at one of these frequencies that is at 
least 5x the RMS value, then a catalog warning flag is 
set for the source observation to indicate that the intra- 
observation varia bility propertie s are unreliable. 

The K-S test (|Massevl 1 1 9 5 If ) is a familiar and well 
established robust test for comparing two distributions 
that are a function of a single variable. In the simplest 
case we compare the cumulative sum of photon events, 
as a function of time, against a linearly increasing func- 
tion that represents a constant flux. This null-hypothesis 
function is modified as necessary to account for data gaps 
and variations in effective area. 

For an observation with N events, let Sjv(t) be the 
cumulative sum of detected events as a function of time 
t, and P(t) the cumulative function that represents a 
constant flux. The K-S statistic Dm is defined as 

D N =su V \S N (t)-P(t)\. 
t 

The K-S derived probability that the two distributions 
5iv(t) and P(t) do not belong to the same population, 
and therefore that the source is variable, is given by 



for detecting periodic signals, it is a perfectly suitable 
method for detecting random variability by forcing the 
period to equal the length of the observation. 

Briefly, the Gregory-Loredo algorithm bins the N ob- 
served photon events into a series of histograms con- 
taining to bins, where m runs from 2 to m max . If 
the observed distribution of events across the to his- 
togram bins is m, ri2, ... , n m , then the probability that 
this distribution came about by chance can be deter- 
mined from the ratio of the multiplicity of the distri- 
bution, JV!/(ni! • TI2I ■ ■ ■ n m l), to the total number, m N , 
of possible distributions. The inverse of this ratio is a 
me asure of the significance o f the distribution. Follow- 
ing [Gregor^&Uoredc] (|1992D . we calculate an odds ratio 
O m for to bins versus a flat light curve as 



Or, 



T 



M (to - 1)! Sr, 



,N 



(N 



D! W„ 



(21) 



where we have rewritten the multiplicity of the distribu- 
tion, W mi as 

N\ 
W„ 



n 



i=i«i' 



Data gaps are accounted for through the binning facto r, 
S m , which is (Appendix B of iGregorv fc Loredolll992l ) 



n 



where 



where 



Pvar = QksWND N ) 



) KS (A) = 2^(-ir 1 , 



(20) 



Equation (J20I) is strictly valid only in the asymptotic 
limit as N — ► 00. In practice N > 20 is "large enough," 
especially if conservative significance levels < 0.01 are 
r equired ( e.g.JPress et al.l I1986D . 

iKuiperl (|1962|) proposed a variation on the K-S test 
that involves replacing the expression for Dn by the dif- 
ference between the largest positive and negative devia- 
tions, 



D 



N 



sup[5jv(*) 
t 



P(t)} 



ini[S N (t) 



P(t)} 



Folding this expression into equation (|20l) yields the 
Kuiper derived probability that the source is variable. 
While the K-S test is primarily sensitive to differences 
between the median values of the cumulative distribu- 
tion functions, the Kuiper test statistic is as sensitive 
to differences in the tails of the distributions. In many 
cases, this makes the Kuiper test a more robust variation 
of the traditional K-S test for evaluating the probability 

that a source is variable. 

The Gregory-Loredo test ([Gregory fc Loredolll992h is 
based on a Bayesian approach to detecting variability. 
The method works very well on photon event data and 
is capable of dealing with data gaps. We have incor- 
porated the capability to include temporal variations in 
effective area. Although the algorithm was developed 



J T/m' 

tj is the amount of good exposure time in bin j, and T 
is the total good exposure time for the observation. The 
odds are summed over all values of to > 2 to determine 
the odds that the source is time- variable. m max is chosen 
for each case in such a way that the odds ratios corre- 
sponding to higher values of to contribute negligibly to 
the total. The probability, p mi of a particular binning, 
to, is simply 



Pn 



Oral Y, J 
3=1 



Summing over bins to > 2 corresponding to a non- 
constant source flux yields the Gregory-Loredo variabil- 
ity probability 



Pgl = Y p i 
i=2 
O 

~i + o' 



(22) 
(23) 



On , and we have made use of the fact 



'j- 



where O — J2T=2 
that Oi = 1. 

The Gregory-Loredo algorithm bins the events into a 
series of light curves of varying resolution, corresponding 
to the number of bins, to, in the range 2 to TO max . Using 
the definitions above, the bins that comprise the normal- 
ized light curve, h m , associated with a specific value of 
to are 



8j N' 



Chandra Source Catalog 



43 



TABLE 7 
Inter-Observation Variability Indices 



Variability 


Reduced \ 2 




Index 


2 Observations 


> 2 Observations 





< 0.4 




<0.8 


3 


> 0.4 < 0.7 


> 0.8 


< 1.0 


•1 


> 0.7 < 1.0 


> 1.0 


< 1.15 


5 


> 1.0 < 2.7 


> 1.15 


< 2.1 


6 


> 2.7 < 7.0 


> 2.1 


< 3.8 


7 


> 7.0 < 12.0 


> 3.8 


< 5.5 


8 


> 12.0 


> 5.5 













and the corresponding standard deviations derived from 
the posterior distribution are 



8j flj,m \-L 



Sj 'lj,m) 



N 



1 



As described bv lGregorv k, Loredol (J1992D . an optimal 
resolution, light curve, h, can be obtained by combining 
the individual light curves, h m , weighted by the proba- 
bilities, p m : 



h = (1 -p G h)hi + 



j=2 



The optimal resolution light curve computed from the 
events included in the source region aperture for each 
source is recorded as a file-based data product (see Ta- 
ble 3) that is accessible through the catalog. As well 
as the light curve, h, this data product includes the un- 
certainty, a, and upper and lower confidence intervals, 
h — 3a, and h + 3 a, respectively. To allow the users 
to verify the significance of features that may be present 
in the light curve, the file also includes the correspond- 
ing quantities derived from the events extracted from the 
background region aperture, using the same binning. 

Careful judgement should be applied when assessing 
the reliability of source variability indicators using the 
source and background light curves. Since the back- 
ground region aperture may contain up to ~ 10% of 
the source flux (see § 3.4.1), the background and source 
light curves may appear similar for very bright sources. 
The PSF wings of unrelated but nearby strongly vari- 
able sources may contaminate both the source and back- 
ground region apertures of the source being investigated. 
An observation may have experienced background vari- 
ations intense enough to be noticeable when compared 
to the target source's count rate, but not strong enough 
to have been removed by background screening during 
observation recalibration. In the first case the source is 
truly variable, but this is not necessarily so in the lat- 
ter two examples. A helpful, though not definitive, test 
is to scale the amplitude of the flux variations in the 
source and background region apertures by their respec- 
tive areas (recorded in the FITS keyword APERTURE). If 
the variations of the two scaled amplitudes are similar, 
then there is a good chance that a background problem 
is responsible. If the source region scaled amplitude is 
considerably larger than the background region scaled 
amplitude and the source is strong, then one is likely to 
have a truly variable source. 



The Gregory-Loredo test appears to provide a more 
uniform and reliable measure of variability than either 
the K-S or Kuiper tests, although the Gregory-Loredo 
algorithm is more "conservative" than the other tests. 
In cases where the K-S and/or Kuiper tests detect vari- 
ability, but the Gregory-Loredo test does not, close in- 
spection of the light curve often, but not always, demon- 
strated that the level of variability does not exceed the 
3 a bounds on the light curve. In cases where there are 
considerable data gaps, Gregory-Loredo is not always be 
able to detect variability on time scales comparable to 
those gaps. 

To provide the user with a short-hand measure of vari- 
ability that allows selection of sources on different de- 
grees of confidence, the CSC includes a set of integer 
"variability index" values in the range [0, 10]. These in- 
dices are based on a combination of the Gregory-Loredo 
probability, J»gl, the logarithm of the odds ratio, O, and 
a secondary criterion that addresses the overall devia- 
tion of the light curve from the mean value. The latter 
criterion is based on the parameters f^ and f$, which 
are the fractions of the light curve that falls within 3 a 
and 5 a of the average rate, respectively. Table 6 defines 
the mapping of the test parameters to variability index 
values. 



3.12.2. Inter- Observation Variability 

Inter-observation variability is based on comparison of 
source region aperture photon fluxes, and their confi- 
dence intervals, from multiple observations in which the 
source is detected. The catalog provides a probability 
that the data are not consistent with a constant-flux 
source, as well as an inter-observation variability index 
that is similar to the index defined for intra-observation 
variability. These measures of variability are assessed 
for each spectral energy band independently, and con- 
sequently no cross-instrument comparison is performed. 
In the first release of the CSC, observations that cover 
the same region of the sky, but in which the source is 
not detected, are not considered when computing inter- 
observation variability. These observations should enter 
into the variability assessment as flux upper limits, since 
they could conceivably be inconsistent with a constant 
source flux. A future release of the catalog will address 
this limitation. 

As mentioned above, the inter-observation variabil- 
ity probability must be interpreted differently from the 
intra-observation variability probability. Whereas the 
light curve can be used to declare a source to be variable 
or non- variable within the time range of a single observa- 
tion, one can never conclude that a source does not vary 
between multiple observations. If inter-observation vari- 
ability is detected, then the source is definitely variable; 
however the converse is not true. 

The inter-observation variability probability is simply 
based on the reduced % 2 of the distribution of the source 
region aperture photon fluxes of the individual observa- 
tions and their confidence intervals. For a source de- 
tected in n separate observations, we first use the source 
region aperture photon flux, Si, and the associated lower 
and upper 1 a confidence limits, S~ and Sf , respectively, 
to compute an initial estimate of the variance-weighted 
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TABLE 8 
Source Codes 



Property 



Bit Encoding 51 



conLcode 



edge 



multi_chip_code 




1 
2 
1 
8 

16 
32 



var_code 




1 
2 
1 
8 
16 
32 



Source is not confused 

Multiple source in source region 

Source region overlaps another source region 

Source region overlaps another background region 

Background region overlaps another source region 

Background region overlaps another background region 



Source docs not dither off detector boundary 
Source position dithers off detector boundary 
Source region dithers off detector boundary 
Background dithers off detector boundary 



Source does not dither between detector chips 
Source position dithers across 2 chips 
Source region dithers across 2 chips 
Background region dithers across 2 chips 
Source position dithers across > 2 chips 
Source region dithers across > 2 chips 
Background region dithers across > 2 chips 



fntra-obscrvation source variability not detected in any band 
fntra-observation variability detected in the AC1S ultrasoft (u) energy band 
fntra-observation variability detected in the ACIS soft (s) energy band 
fntra-observation variability detected in the ACIS medium (m) energy band 
Intra-observation variability detected in the ACIS hard (h) energy band 
Intra-observation variability detected in the ACIS broad (6) energy band 
Intra-observation variability detected in the HRC wide (w) energy band 



a Non-zero bit encodings are additive, so that (e.g.) var_code = 28 would mean the intra- 
observation variability was detected in the ACIS medium, hard, and broad energy bands. 
b "Chip" refers to either an ACIS CCD or a HRC micro-channel plate. 



mean source region aperture photon flux 

Si 



Sq 



£^ 



:/Ei 
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where we take er ,i = (S^ — S~)/2. Using this estimate 
of the mean flux, we define the "effective a" for the ith 
observation of the source as 

Si — S i Si > Do 

S i — Si Si < Sq 

(Sf-Sr)/2 S t = S . 

A refined estimate of the variance- weighted mean flux is 
then given by 



S 



^r 



s, 



i=i 



n 
i=\ ' 



and the reduced x 2 is 

,2 _ J v^ (Si - 

-1<H a. 



X 



n ■ 



(Sj - Sf 

2 



i=l 



The inter-observation variability index is assigned on 
the basis of the reduced x 2 , according to Table 7. Note 
that the values 1, 2, 9, and 10 are not used. 

3.13. Source Codes and Flags 

Each entry in both the Master Sources Table and 
the Source Observations Table includes several source- 
specific flags and codes that identify specific circum- 
stances that may be of relevance to the catalog user. 
Some flags and codes are used to encode source prop- 
erties that are commonly searched for by users, as an 



aid to simplify catalog queries. However, in most cases 
flags and codes are intended to warn the user of condi- 
tions that may degrade the quality of measured source 
properties, or that may limit the usefulness of the source 
detection for some investigations. 

The codes and flags included in the Source Observa- 
tions Table are defined in Table 1. Flags are Boolean 
quantities that describe "yes/no" or "true/false" prop- 
erties, whereas codes are multi-bit data values that en- 
code several levels of information. Translations of the 
bit-encodings can be found in Table 8. 

The extent and variability codes require additional ex- 
planation. The former encodes a conservative estimate 
of whether the intrinsic extent of a source, a rss [equa- 
tion (HHJ)], is inconsistent with the extent of the local PSF 
in each energy band. Specifically, a source is considered 
extended in an energy band if a rss > 5 Aa rss in that en- 
ergy band, where Aa rss is the uncertainty in a rss , given 
by equation (jlip . The variability code bit corresponding 
to a specific energy band is set if the intra-observation 
variability index (Table 6) > 3. A zero code therefore im- 
plies that the source is either definitely not variable, not 
considered variable, or probably not variable, depending 
on the value of the variability index. Similarly, a non- 
zero code implies that the source either may be variable, 
is likely to be variable, is considered variable, or is defi- 
nitely variable. 

The remaining codes and flags all warn of conditions 
that may affect derived source properties to some extent. 
The streak source flag, if set, indicates that the source 
detection is located on an ACIS readout streak. If the 
readout streak is associated with a bright source, then 
there is a significant probability that the source prop- 
erties may be compromised. This is particularly true of 
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TABLE 9 
ACIS Observation False Source Rate 



Fig. 22. — Example of a highly piled-up source detected using 
ACIS. The core of the image has been eroded by photon pile-up, so 
that the source has a cratered appearance (i.e., the photon density 
has an apparent minimum at the source location). Bright spots 
on the ring are detected as distinct sources, shown in red. These 
source detections are manually adjusted to include only a single 
source centered on the crater, shown in green. 

aperture photometry values. If the streak is especially in- 
tense, then the source detection may not be real. If the 
saturated source flag is set, then the source is definitely 
real, but is so bright that photon pile-up has eroded the 
core of the source image so that a single source has a 
"cratered" appearance (Fig. 22). All source properties 
are compromised. 

The Master Sources Table includes only flags, and 
these are defined in Table 2. In most cases, Master 
Sources Table flags summarize the corresponding Source 
Observations Table flags and codes for all of the source 
detections that have "unique" linkages to the master 
source. The confusion flag is an exception to this, in 
that it is set for a master source if the confusion codes 
for any of the associated source detections indicate that 
multiple sources are present in the source region or that 
the source region overlaps another source region, or if 
there are any source detections that have "ambiguous" 
linkages to the current master source. 

The master extent and variability flags are set if the 
corresponding codes for any uniquely matched source de- 
tections indicate that the source is extended or variable 
(as appropriate) in any energy band. The remaining mas- 
ter source flags are set only if the corresponding Source 
Observations Table flags are set for all uniquely associ- 
ated source detections, indicating that the corresponding 
warning criteria are violated in all observations of the 
source. 

3.14. Quality Assurance 

The scientific integrity of the CSC is guaranteed 
through a set of quality assurance steps that are per- 
formed as part of the catalog construction process 
(|Evans et al. 2008). Many of these analyses are executed 
automatically at the completion of each stage of catalog 
pipeline processing, so that any issues can be identified 
and corrected before they can affect downstream pro- 



Configuration 


Livetime (ks) 


False Source Rate 


ACIS-012367 


9 


0.0 


ACIS-235678 


10 


0.02 


ACIS-012367 


29 


0.0 


ACIS-235678 


30 


0.12 a 


ACIS-235678 


51 


0.21 


ACIS-012367 


68 


0.22 


ACIS-235678 


118 


1.2 


ACIS-012367 


125 


1.28 



a For this set of simulations, background data 
for CCD S4 (ACIS-8) were unavailable; the false 
source rate was renormalized to account for the 
missing chip data. 

cessing. These mechanisms detect pipeline processing 
errors, and identify potential data quality issues by com- 
paring key diagnostic output products with predefined 
standards. Each standard that is violated will cither trig- 
ger a human review to determine how to proceed, or will 
initiate one or more automated actions. The latter typi- 
cally result in termination of the processing thread for a 
subset of the input data. 

The vast majority of violations that occur because of 
data quality issues address the reality of detected sources, 
and are typically resolved without human intervention. 
Following the source detection step, detected source re- 
gions that are either significantly smaller than the di- 
mensions of the local PSF or significantly larger than 
the maximum expected source size, or which exceed a 
maximum ellipticity threshold, are deemed to be arti- 
facts, and the processing thread for the source region is 
terminated immediately to avoid evaluating source prop- 
erties unnecessarily. Sources that have too few counts, 
or that have a detection significance that is too low to 
pass the catalog SNR threshold are similarly discarded. 

The cores of sources observed with ACIS that are 
sufficiently bright can be eroded by photon pile-up. 
The source detection algorithm incorrectly detects bright 
spots on the ring surrounding the dark center of the im- 
age as distinct sources. Saturated sources are identified 
using a sliding matched filter algorithm. The source de- 
tections are manually adjusted so that a single source 
centered on the crater is included in the Source Observa- 
tions Table for the source, and the source properties are 
flagged as having been manually modified, so that the 
user can exclude such sources if they so wish. The tab- 
ulated source position errors are unreliable for sources 
whose regions have been manually modified. 

A more detailed comparison of the source dimensions 
with the local PSF is performed after the source prop- 
erties are computed, to identify sources that are statis- 
tically smaller than the PSF in all energy bands. The 
fraction of the local PSF that is included within the 
modified source region aperture (as defined in § 3.4.1) 
must be sufficiently large that the source location and 
aperture photometry can be computed from the fraction 
of the aperture that is not contaminated by overlapping 
sources. Finally, the SNR of the source is evaluated and 
compared with the minimum required for inclusion in the 
catalog. 

Human review is primarily required to address the oc- 
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Fig. 23. — False source rate as a function of flux significance for a simulated 125 ks observation. The maximum flux significance across 
all science energy bands is plotted. Left: BI-FI Chip Sources are those whose source regions dither across the CCD S2 (ACIS-6; back- 
illuminated)— S3 (ACIS-7; front-illuminated) boundary. Single Chip Sources are those whose source regions are completely contained on 
only a single chip. Right: Chip Edge Sources are those whose source regions dither off a chip edge during the observation. Non-Edge 
Sources are those whose source and background regions do not dither off a chip edge. 



currence of unexpected pipeline warnings or errors. Al- 
though an automated process performs the laborious task 
of scanning the log files associated with each processing 
pipeline to identify problems, the wide diversity of possi- 
ble error conditions require human intelligence to assess 
the reason for the failure and determine how to proceed. 
The typical response is to terminate the current process- 
ing thread, perform any needed repairs, and initiate re- 
processing of the thread. 

Other conditions that trigger a human review are pre- 
cautionary in nature, and include cases where the local 
spatial density of detected sources is too high, or the total 
number of sources detected in the field of view exceeds 
a predefined threshold. Although these conditions most 
likely arise because of field crowding, they could indicate 
an error in the source detection process. Errors that gen- 
erate a large number of sources would require substantial 
cleanup if processing was allowed to continue incorrectly. 

4. STATISTICAL PROPERTIES 

A detailed characterization of the statistical properties 
of the CSC is beyond the scope of this paper, but is the 
subject of a comprehensive discussion by F. A. Primini 
et al. (2010, in preparation). Here, we merely present 
a summary of the principal statistical properties of the 
catalog for reference. 

Statistical characterization of catalog source properties 
is accomplished primarily by using simulated and empty 
field (blank-sky) observations, together with datasets 
consisting of empty fields that have simulated sources 
with known properties added. These datasets are pro- 
cessed by the catalog pipelines in the same manner as 
real observations. 

4.1. False Source Rate 

To estimate false source rates, a series of blank-sky sim- 
ulations with exposure times of ~ 10, 30, 60, and 120 ks 
were constructed for typical ACIS imaging CCD config- 
urations. For each simulation, a template background 
event list for each active CCD was used to define the 
overall spatial variation of the background, and the total 
number of background events w as determined from the 
nomi nal field background rates {(Chandra X-rav Center! 
2009) and the simulated exposure time. For all CCDs 



except chip S4 (ACIS-8) the template background event 
lists recorded in the instrumental calibration database 
were used. Chip S4 is significantly affected by a variable 
pattern of linear streaks that appear to be caused by a 
flaw in the serial readout which randomly deposits sig- 
nificant a mounts of ch arge along pixel rows as they are 
read out (|Houckll2000[ ). Because of this issue, no ade- 
quate template is available for chip S4, and so one was 
constructed by combining several CSC event lists that 
do not include bright sources on that CCD. Each sim- 
ulated blank-sky event list was then processed through 
the CSC pipeline source detection steps. The false source 
rates derived from these simulations are reported in Ta- 
ble 9. From these data we derive a simple linear relation 
for the number of false sources per field as a function of 
livetime, namely 

log(i? fs ) = -3.345 + 1.6 x log(tH V e), 

where Rf s is the false source rate, and tii ve is the exposure 
livetime in units of ks. Using this relation, we estimate 
that ~ 370 sources (~ 0.4%) included in the catalog are 
spurious. 

As can be seen from the table, the false source rate is 
appreciable only for exposures longer than ~ 50 ks. There 
is some evidence for a clustering of false source detections 
near chip edges and at the boundaries between the back- 
and front-illuminated CCDs. This should not be surpris- 
ing since the low spatial frequency background is poorly 
constrained or changing rapidly in these locations. To in- 
vestigate these effects further, the false source rates near 
the chip edges and interfaces were examined separately 
for the longest simulated exposures. Figure 23 demon- 
strates that false source rates are enhanced in these re- 
gions for the 125 ks simulation. 

4.2. Source Detection Efficiency 

Source detection efficiency is characterized using point 
source simulations. A spatially random distribution of 
point sources is added to the blank sky sim ulations de- 
scrib ed above using the MARX simulator (Wis e et all 
l2003f ) to generate the incident X-ray photons. Sepa- 
rate simulations were generated for non-thermal sources 
with a power- law spectral distribution Fe <xE , and 
for thermal black-body sources with temperature kT = 
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Fig. 24. — Cumulative detection efficiency estimates from ACIS-012367 simulations of point sources with absorbed power-law spectral 
distributions (r = 1.7, iVjj = 3 X 10 20 cm - 2 ). Simulated source fluxes were drawn from a power-law N > S distribution with index 1.5 
and an overall normalization adjusted to yield a few hundred detectable sources per simulation. Thirty simulations per exposure time 
were calculated. Detection efficiency is computed by comparing the measured and input N > S distributions in the ACIS broad energy 
band. The ratio of these two distributions represents the fraction of input sources of a given incident flux that are actually detected. 
Left: Detection efficiency for a ~9ks exposure. The solid, dashed, and dotted lines plot the detection efficiencies for sources with < 5', 
5' < 8 < 10', and 10' < 6 < 15', respectively. Because of the power-law N > S distribution, relatively few bright sources were simulated 
and the plotted detection efficiency does not smoothly approach 100% for bright sources because of small number statistics. Right: As for 
the left panel, except for a ~125ks exposure time. 



1.0 keV, spectra. A neutral Hydrogen absorbing column 
TVh = 3 x 10 20 cm~ 2 was assumed for all sources. Source 
fluxes were drawn from a power-law N > S distribution 
with index 1.5. The overall normalization was adjusted 
to yield a few hundred detectable sources per simulation, 
a compromise aimed at reducing source confusion while 
limiting the total number of simulations required to ob- 
tain go od statistics. The effects of photon pile-up (jDavisI 
l2001bf ) and observation-specific bad pixels were included 
by post-processing each simulation with marxpileup and 
acis_process_events, respectively. The source events 
from the MARX simulations were then merged with the 
appropriate simulated blank-sky event lists, keeping only 
MARX-simulated source events that fell on active CCDs 
for the observation. As with the blank-sky simulations, 
simulated event lists were then processed through the 
CSC pipeline source detection and source properties ex- 
traction steps, and the resulting sources that would have 
been included in the catalog were tabulated. Finally, 
these sources were cross-referenced with the input source 
lists to allow a source-by-source comparison of input and 
derived properties. 

Source detection efficiency is determined by compar- 
ing the measured N > S and input N > S distribu- 
tions. The ratio of these two distributions represents the 
fraction of input sources of a given incident flux that 
are actually detected. Results of the comparison for the 
ACIS broad energy band detections from the shortest 
and longest ACIS-012367 power-law spectral distribution 
simulation sets are shown in Figure 24. The standard 
CSC processing pipeline further combines ACIS source 
detections from the broad, soft, medium, and hard en- 
ergy bands to construct the final detected source list. 
This step was not performed as part of these simulations. 
However, since the simulated source spectra are homoge- 
nous and well detected in the broad energy band, the 
difference is not significant in this case. 



4.3. Absolute Astrometric Accuracy 

As mentioned in § 3.5.1, the absolute astrometric ac- 
curacy of release 1 of the CSC was evaluated post facto 
by cross-matching catalog sources with their co unter- 
parts from the SDSS DR7 (jAbazaiian et al J 12009). Like 
the CSC, the SDSS DR7 is referenced to the Inter- 
natio nal Celestial Reference System (ICRS; lArias et al.l 
1995), and has statistical positional uncertainties ~ 
45 milliarcseconds (mas) rms per coor dinate for bright 
stars , with systematic errors < 20 mas (jAbazajian et al.l 
l2009h . Only CSC-SDSS source pairs with more than 90% 
match probability, evaluated accordi ng to the Bayesian 
proba bilistic formalism described by Budavari fc Szalavl 
(2008), were evaluated, resulting in 6,310 source pairs 
associated with 9,476 sources detected in individual ob- 
servations. Full details of the analysis and results are 
presented bv lRotsI (|2009f ) and F. A. Primini et al. (2010, 
in preparation). Here we summarize the main result. 

For each matching CSC-SDSS source pair, the sepa- 
ration, p, and the total 1 a positon error are computed, 
summing in quadrature the CSC and SDSS errors (and 
remembering that CSC position errors are reported as 
95% uncertainties). We then examine the value of re- 
duced x 2 = J2(p/ a tot) 2 /{n - 1) for bins in a to t covering 
the range ~ 0.1—2". The value of the reduced x 2 is rea- 
sonably close to 1, except for atot ;$ 0.3" (indicating that 
the errors are underestimated in that range). Adding a 
systematic astrometric error component of 0"16±0"0.01 
to <7tot yields reduced x 2 near 1 for all values of atot ■ We 
therefore adopt that value as the systematic astrometric 
error present in release 1 of the CSC. 

The distribution of the normalized separations for the 
CSC-SDSS source match pairs is shown in the left panel 
of Figure 25, together with the theoretical Rayleigh dis- 
tribution for the same number of sources. The overall 
shape of the curve agrees with the Rayleigh distribution, 
although there is a slight deficit at high values of normal- 
ized separation, suggesting that the overall error may be 
overestimated for sources at large off-axis angles. In the 
right panel of Figure 25, we present the average CSC- 
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Fig. 25. — Left: Distribution of the normalized separations for matching CSC-SDSS source pairs (solid line). The theoretical Raylcigh 
distribution for the same number of sources is shown hatched. Right: Average CSC-SDSS source pair separation with 0'.'16 systematic 
astromctric error included, as a function of off-axis angle, 0. 



SDSS separation as a function of off-axis angle, with the 
systematic astrometric error included. The average CSC 
1 a positional error ranges from 0'.'2 on-axis to ~ 3'.'5 at 
— 14' off-axis. 

4.4. Photometric Accuracy 

To assess the accuracy of CSC source fluxes, the mea- 
sured source region aperture photon fluxes are com- 
pared with the input photon fluxes of the simulated 
point sources. Figure 26 presents the comparison of 
ACIS broad energy band photon fluxes for simulated 
sources with a power-law spectrum. Inspection of the 
figure reveals good agreement for sources with off-axis 
angles within 10' of the aimpoint. For sources beyond 
10', photon fluxes appear to be systematically overes- 
timated by a factor of ~ 2 for sources fainter than 
~3x 10 -6 photons cm~ 2 s _1 . 

The systematic error in the faint flux bins is more 
prominent in the ACIS soft energy band, in which the 
measured fluxes appear under-estimated for all simulated 
input flux levels. Further investigation of this effect will 
be reported by F. A. Primini et al. (2010, in preparation). 
Preliminary analysis suggests that the effect results from 
the use of a monochromatic exposure map (computed 
at the effective energy of the band) when determining 
source fluxes. Models based on this assumption repro- 
duce the general features of the apparent systematic er- 
rors, and for the assumed model power-law spectrum the 
error is ~ 10% in the broad, medium, and hard energy 
bands, ^20-30% in the soft energy band, and ^30% in 
the ultra-soft energy band. 

5. CONCLUSIONS 

The Chandra Source Catalog is a general purpose vir- 
tual X-ray astrophysics facility that provides access to a 
carefully crafted set of scientifically useful quantities for 
individual X-ray sources observed by the Chandra X-ray 
Observatory. The first release of the catalog was pub- 
lished to the astronomical community in March 2009, 
and includes source properties for 94,676 point and com- 
pact X-ray sources detected in a subset of public ACIS 
imaging observations from roughly the first eight years 
of the Chandra mission. This release of the catalog in- 
cludes sources with observed spatial extents are < 30", 
and whose flux estimates are at least 3 times their esti- 



mated 1 a uncertainties. Observations that include sub- 
stantially extended sources are not included in the first 
release of the catalog. For each X-ray source, the cat- 
alog tabulates about 60 distinct measured and derived 
source properties, generally with associated lower and 
upper confidence limits, in several energy bands. These 
properties are generally derived from all of the observa- 
tions in which a source is detected. However, in the first 
catalog release, multiple observations are not combined 
prior to source detection, so the depth of the catalog is 
limited by the duration of the longest single exposure 
of a field. The catalog further tabulates roughly 120 
observation-specific properties for each observation of a 
source, again with associated lower and upper confidence 
limits, and in several energy bands. 

Tabulated source properties include source position, 
spatial extent, multi-band aperture fluxes computed in 
several different ways, X-ray hardness ratios and spec- 
tral model fits, and intra- and inter-observation variabil- 
ity measures. In addition to these "traditional" catalog 
elements, for each source detection the catalog includes 
an extensive set of FITS format file-based data products 
that can be manipulated interactively by the user, includ- 
ing source images, event lists, light curves, and spectra 
from each observation in which a source is detected. 

Looking towards the future, release 1.1 of the catalog, 
scheduled for spring 2010, will include data from pub- 
lic HRC-I imaging observations and newly public ACIS 
imaging observations, but will otherwise retain the same 
limitations as release 1. In release 2, we plan to co-add 
multiple observations of the same field that use the same 
or similar instrument configurations, and that have sim- 
ilar spacecraft pointings (within ~ 30") prior to source 
detection, to achieve fainter limiting sensitivities in many 
fields. We anticipate that new algorithms will allow this 
release to have a significantly fainter source detection 
threshold than release 1. This release should also pro- 
vide limited improvements in the area of extended source 
handling (for example allowing for the inclusion of ex- 
posures containing moderately extended emission from 
galaxy cores up to ~ 60" spatial scale), as well as nu- 
merous algorithm enhancements that will refine field and 
source property calculations. 
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Fig. 26. — Comparison of input (Fo) and measured (F) ACIS broad (b) band fluxes for simulated sources with power-law spectra and 
off-axis angles 8 < 10' (left) and 8 > 10' (right). For each bin, the horizontal line indicates the median measured flux value. The boxes 
include 90% of the measurements in each bin, and the vertical lines indicate the extreme values. Bins colored red include fewer than 100 
measurements; bins colored blue include 100-400 measurements; bins colored black include more than 400 measurements. The green line 
has a slope of 1. 
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APPENDIX 
A. MASTER SOURCE MATCHING ALGORITHM 

The procedure, referenced in § 3.4.2, for matching source detections from multiple observations that overlap the 
same region of the sky, is described here. These steps must be executed for each observation, but the outcome does 
not depend on the order in which the observations are processed. 

The algorithm defines the overlap ellipse of a source detection to be the PSF 90% ECF aperture in the energy band 
that has the highest number of aperture source counts. The following two assumptions are made: (a) if a source 
detection in one observation is resolved into multiple source detections in a second observation, then the overlap ellipse 
corresponding to the former detection will overlap all of the overlap ellipses corresponding to the latter detections, 
and (b) multiple source detections in a single observation correspond to distinct sources on the sky, even if the overlap 
ellipses intersect spatially. 

For the set S of source detections identified in the current observation, the following 9 steps are performed. 

(1) Identify the sets Mi, Mj, ... of candidate matching source detections in observations i,j, . . . that overlap the 
current observation. Candidate matching source detections are those source detections that have radial separations on 
the sky from any member of the set S that are smaller than some predefined radius, r. For convenience, we designate 
the union Mi U M 3 U ... as the set M. 

(2) Compute the overlap ellipses, defined above, for each member of the sets S and M. 

Two source detections a and b in observations 1 and 2, respectively, are deemed to overlap if and only if 

A[anb}/A[a] > 0.15 or A[a n b]/A[b] > 0.15, 

where A [a] is the area of the overlap ellipse of source detection a, and A[an6] is the area of the spatial intersection of 
the overlap ellipses of a and b. This has the effect of dismissing very small overlaps. 
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Fig. 27. — Example "confused match" source detections. Source detection a from observation 1 , overlaps source detections b\ and c 
from observations 2 and 3, respectively. Source detection c also comprises a partial unambiguous match with source detection &2 from 
observation 2 . The latter connection implies that source detections a and c are confused matches with the pair of sources b\ and 62 from 
observation 2. 



In addition, if a source detection a in observation 1 overlaps multiple source detections b\ , bi , . . . , b n (n > 1 ) in 
observation 2, and 

A[anb t }/A[a} > 0.67 and A[a n bj]/A[a H bi] < 0.33 

for all j ^ i, then only the overlap between source detections a and bi is recognized. The remaining source detections 
bj included in observation 2 arc deemed not to overlap source detection a. This has the effect of recognizing only a 
single dominant overlap and ignoring additional smaller overlaps from the same observation. 

(3) Compute the subset Af C S of source detections that do not overlap any member of the set M. of candidate 
matching source detections. Af comprises the set of source detections that must be added to the Master Sources Table 
as newly identified master sources. 

(4) Compute the set V of members of S U M. that comprise partial unambiguous matches. A source detection a 
included in observation 1 is a partial unambiguous match to a source detection b included in a different observation 2 if 
and only if (a) a overlaps 6, (b) a does not overlap any other source detection included in observation 2, and (c) b does 
not overlap any other source detection included in observation 1 . Sources included in V are uniquely matched between 
pairs of overlapping observations, but are not necessarily uniquely matched between all overlapping observations. 

(5) Compute the subset WCPof unambiguous matches. An unambiguous match between source detections a, b, c, . . . 
included in observations 1,2,3,... occurs when all pairs ([a, b], [a, c], [a, .. .],[b,c],[b, ...],. . .) of source detections in- 
dividually form partial unambiguous matches. U comprises the set of source detections that are uniquely matched to 
existing master sources in the Master Sources Table. 

Figure 15, Left is an example of an unambiguous match. 

(6) Compute the set C of members of S U M. that comprise confused matches. A confused match results when a 
source detection a included in observation 1 overlaps multiple source detections b,c, . . . that either (a) (i) are included 
in a single observation 2 and (ii) overlap no other source detection in observation 1 than a, or (b) consist of partial 
unambiguous matches such that there is at least one observation that is common amongst the partial unambiguous 
matches for all of the sources b,c, . . .. C comprises the set of confused source detections that must be flagged as confused, 
and linked ambiguously to the corresponding master sources in the Master Sources Table. 

An example of case (a) above is shown in Figure 15, Center. In the case (b) above, note that source detection a is 
not required to overlap all of the individual source detections that comprise each of the partial unambiguous matches 
(e.g., Fig. 27). 

(7) Once the set C of confused matches is determined, steps (2)-(5) should be re-applied to the set {SUAi }\{AfUUUC} 
to identify additional members of the sets Af and U that were previously missed because they were overlapped by one 
or more confused source detections. 

(8) After all members of the sets Af and U have been identified, re-examine members of the set C to verify that 
the overlaps of source detections that caused each member to become assigned to set C are overlaps with members of 
Af and U. If any source detection which is not a member of Af or U overlaps a member of C, and that overlap was 
relied on to assign the member to C, then remove the member from set C. The removed member will revert to an 
uncategorized source detection. 

(9) At this point, the set {S U jvl}\{jVuW U C} consists of source detections that cannot be merged or linked to 
master sources using the above rules. These source detections typically (but not exclusively) overlap at least two other 
sources that were observed in different observations and that do not overlap each other. We designate members of this 
set H as human-review matches. Manual review is required to disambiguate the source matches. 

Figure 15, Right is an example of a human- review match. 
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B. COMBINING SOURCE POSITIONS 

As described in § 3.5.2, a multivariate optimal weighting formalism is used to improve the estimates of the position 
and positional uncertainty of each X-ray source by combining the statistically independent source detections included 
in the set of individual observations. The source position error uncertainties are expresse d in the form of error ellipses 
centered upon the estimated source positions. Details of the derivations can be found in iDavisI (|2007b[ ). 

The improved estimates of the source position, X, and associated covariance matrix, a 2 , are 



X = a 2 Y^;a 2 



^rr 2 



(Bl) 



where X^ represents the ith esti mate of the mean of the two-dimensional source position, and of denotes the 2x2 
covariance matrix, equation (|B7|) below, associated with this estimate. 

Before the covariance matrix o 2 can be computed, the individual error ellipses must be mapped from the celestial 
sphere onto a common tangent plane. The ith estimate of the source position is specified as a confidence-ellipse 
centered upon the celestial coordinates (ckj, Si), with the major axis of the ellipse making an angle 9i (— 7r < 9i < ■k) 
with respect to the local line of declination at the center of the ellipse. The celestial coordinates (on, Si) correspond to 
a unit vector 

pi = x cos on cos Si + y sin on cos Si + z sin Si 

on the celestial sphere, where (x, y, z) are orthonormal basis vectors oriented such that x points to the origin of right 
ascension on the celestial equator, z points to the North celestial pole, and y completes the right-hand Cartesian 
system. 

The common tangent plane is constructed on the celestial sphere at the position p , which is taken to be the 
arithmetic mean of the ellipse centers p^. 



Pa = y^ji/ 



E 



Pi 



(B2) 



The tangent plane coordinates (xi, yi) corresponding to (pn,8i) are 

Xi = (fii ■ e x )/(pi -p ) 

Vi = (Pi-ey)/(pi-p ), (B3) 

where e x and e y are orthonormal basis vectors parallel to the local lines of right ascension and declination at po, i.e., 

e x = —x sin an, + V cos an. 

e y — —x sin So cos an, — y sin So sin an, + I cos So , 

where (ao, ^o) are the celestial coordinates that correspond to po- 

Similarly, the unit vectors on the celestial sphere corresponding to the end-point positions of the semi-minor and 
semi-major axes of the ellipse are given by 

Pf mm ' =P l cos C in ° r + ol % sin (/>~ cos 6 Z - Si sin (/>~ sin 9 % 

-major - /major , - • /major • /i , c • /major r, f-n a\ 

p i ' = pi cos <pi +«iSin^ : J smVi + OiSiiKpi J cos Oi, (B4) 

where 0™ lnor and </>™ ajor are the arc-lengths of the semi-minor and semi-major axes, respectively, and on and Si are 
unit vectors that point along the directions of increasing right ascension and declination, respectively, at the position 

Pi- 

The lengths of the semi-minor and semi-major axes on the tangent plane are given by 



'i,i= VW in ° r - x i? + (2A min ° r - Vi? 



*' -^« ajor -^) 2 + (yr jor -2/,) 2 , (B5) 



respectively, where we have denoted the tangent plane coordinates of p mmor and p™ ajc " as (a;f llnor , yf 111101 ) and (a;™ ajor , 
y™ ajor ), respectively. The angle that the semi-major axis makes with respect to the local line of declination is 

(B6) 



maior / 

\Vi -y^/ 

Armed with the projections of the individual error ellipses projected on the common tangent plane, equations 
(|B3 |) (|B6|) . covariance matrices can be computed as follows. 

The three parameters that specify the geometry of each projected error ellipse are the lengths of the semi-major and 
semi-minor axes, and the position angle i? that the major axis of the ellipse makes with respect to the tangent plane 
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y axis. The semi-major and semi-minor axis lengths correspond to the 1 a confidence intervals along these axes. In a 
basis whose origin is at the center of the ellipse, and whose y axis is along the major axis of the ellipse, the covariance 
matrix is 



< 



where a[ ^ and a' 2 ^ are the ler confidence values along the minor axis and major axis of the ellipse, respectively 
(°~'i i > °i j)- The form of the covariance matrix in the unrotated system is 



a? = 



_ ( a'i ti cos 2 di + a'^.i sin 2 di {af^ — af^) cos di sin di 
{a'^i — (j'ii) cos $i sin di a^ i sin 2 di + af t cos 2 di 



(B7) 



where di is the angle that the major axis of the ellipse makes with respect to the tangent plane y axis. 

At this point, equation (|B7|) can be used to compute the covariance matrices from the len gths of the semi-minor 
and semi-major axes of the source position error ellipses in the tangent plane, equations (|B5[) . The error ellipses for 
the individual source observations are then combined using equation (|B1[) to compute the optimally weighted source 
position and position error ellipses on the tangent plane for the combined set of observations. The mapping of the 
optimally weighted er ror e ll ipse f rom the ta nge nt plane to the celestial sphere can be performed using the inverse 
relations of equations (|B3| . (|B4|) . (jB"5l) . and ([B6]) . 
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